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TITLE OF THE INVENTION 

ASSAYS FOR NUCLEAR RECEPTOR UGANDS USING FRET 



CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional 
Application No. 60/061,385, filed 10/7/97, the contents of which are 
incorporated herein by reference in their entirely. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not applicable. 

REFERENCE TO MICROFICHE APPENDIX 
Not applicable. 

FIELD OF THE INVENTION 

This invention relates to methods of identifying novel - 
agonists and antagonists of nuclear receptors utilizing the agonist- 
dependent interaction of such receptors with CREB-binding protein 
(CBP) or other nuclear receptor co-activators in which this interaction is 
detected by fluorescence resonance energy transfer. 

BACKGROUND OF THE INVENTION 

Nuclear receptors are a superfamily of Hgand-activated 
transcription factors that bind as homodimers or heterodimers to their 
cognate DNA elements in gene promoters. The superfamily, with more 
than 150 members, can be divided into subfamilies (e.g. the steroid, 
retinoid, thyroid hormone, and peroxisome proliferator-activated 
[PPAR] subfamilies). Each subfamily may consist of several members 
which are encoded by individual genes (e.g. PPARa, PPARy, and 
PPAR5). In addition, alternative mRNA splicing can result in more 
than one isoform of these genes as in the case of specific PPARs (e.g. 
PPARyl and PPARy2). The nuclear receptor superfamily is involved in 
a wide variety of physiological functions in mammalian cells: e.g., 
differentiation, proliferation, and metabolic homeostasis. Dysfunction 
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ordered expression „f specific Dndear reKptm 
involved in disease pathogenesis. 

The PPAR subfamily of nuclear receptors consists of »h™ 
members: PPARa, PPARy, and PPAR5. PPAKa is hiXe^s^ 
5 bve, and fadney. Action of PPA*, by peroxisome pJnf^, m 
tadudrng hypolipidwc ^ M 

long-chmn tatty acids is responsible for the induction of acyl" 
oxidase and hytatas^ehydrogenase (endues required Ir 

J*! 'f " a ^™ 1 Thus.PPAR.xhasanimpZnt 

enects. PPARyis predominantly expressed in adipose tissue Eecentlv 
15 been identified as a potential physiological ligand 15 

expression of PPARtf m fibroblasts induced adipocyte differentiation, 
demons^ ^ role of PPARy in adipocyte dnCtiation^d 
storage. The demonstration that anti-diabetic and hpid-Weriag 
20 ^ sensitizing expounds known as Oi^aaJLS^tlA 
an^ hgandsforPPARysuggesteabr.ad therapeutic ^e fcS, 
ugand, » the treatment of diabetes and disorders associated vn* 
.nsuUn instance fe,. obesity and cardiovascular diseaseT 
Nuclear "esPlor proteins contain a central nui 

MD . imposed of two highly conserved zinc fingers *hat target the 
Receptor to specific promoter/enhancer DNA sequences knownas 

aXriZr >e j el T ,te(HBES) TO °^^t200^amm„ 
aads m length and is less well conserved than the DBD. There are at 

^acr^r fw * eLBD:d ^ a ^^^.^a 

^nsa^ation. The transaction function can be viewed a, a 
molecular switch between a tianscriptionaUy inactive and a 
tansenphonauy active state of the receptor. Binding of a ligand which 

uomain (Ar2) that is required for the switch. 
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The ligand-induced nuclear receptor mol cular switch is 
mediated through interactions with members of a family of nuclear 
receptor co-activators (e.g., CBP/p300, SRC-l/NcoA-1, TIF2/GRIP- 
l/NcoA-2, and p/CIP). Upon binding of agonist to its cognate receptor 
LBD, a conformational change in the receptor protein creates a co- 
activator binding surface and results in recruitment of co-activator(s) to 
the receptor and subsequent transcriptional activation. The binding of 
antagonist ligands to nuclear receptors will not induce the required 
conformational change and prevents recruitment of co-activator and 
subsequent induction of transcription. The co-activators CREB-binding 
protein (CBP) and p300 are two closely related proteins that were 
originally discovered by virtue of their ability to interact with the 
transcription factor CREB. These two proteins share extensive amino 
acid sequence homology. CBP can form a bridge between nuclear 
receptors and the basic transcriptional machinery (Kamei et al., 1996, 
Cell 85:403-414; Chakravarti et al., 1996, Nature 383:99-103; Hanstein et 
al., 1996, Proc. Natl. Acad. Sci. USA 93:11540-11545; Heery et al., 1997, 
Nature 387:733-736). CBP also contains intrinsic histone 
acetyltransferase activity which could result in local chromatin 
rearrangement and further activation of transcription. Ligand- and 
AF2-dependent interaction between certain nuclear receptors and CBP 
has been demonstrated in in vitro pull down assays and far-western 
assays. This interaction is both necessary and sufficient for the 
transcriptional activation that is mediated by these nuclear receptors. 
Thus, an AF2 mutant of the estrogen receptor (ER) which abolishes the 
transcriptonal function of the receptor is incapable of interacting with 
CBP. 

The N-termini of CBP and p300 have been shown to interact 
with the ligand-binding domains of some nuclear receptors (Kamei et 
al., 1996, Cell 85:403-414, hereinafter "Kamei"). Kamei was able to 
demonstrate direct interaction of CBP and p300 with nuclear receptors 
by several different methods: 

(1) Kamei produced GST fusion proteins of the first 100 
amino acids of the N-terminus of CBP. These fusion proteins were run 
out on a polyacrylamide gel, transferred to a membrane, and the 
membrane was exposed to 32p-l a beled ligand-binding domains of 
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nuclear receptors. In th pres nee of ligand, a specific binding 
interaction between the CBP and nuclear receptor fragments was 
detected in that the 32 P .i abe led Hgand-binding domains were observed to 
bind to the bands on the membrane containing the GST-CBP fusion 
proteins. 

(2) Kamei also utilized the yeast two-hybrid system. The 
Hgand-binding domain of the nuclear receptor fused to the DNA-binding 
domain of the LexA protein was used as bait. The amino terminal 
domain of CBP fused to the gal4 transactivation domain was used as 
prey. In the presence of hgand, a specific binding interaction (occurring 
m vwo, i.e. y within the yeast) was observed between the CBP and nuclear 
receptor fragments. 

(3) Kamei observed hgand-induced binding between CBP 
and nuclear receptors via a gel-shift assay. This assay is based on the 
observation that, in the presence of ligand, nuclear receptors will bind to 
oligonucleotides containing their target recognition sequence. Such 
binding results in the formation of a nuclear receptor-Kgand- 
oligonucleotide complex having a higher molecular weight than the 
oligonucleotide alone. This difference in molecular weight is detected 

20 via a shift in position of the 32 P .i ab eled oligonucleotide when it is run out 
on a polyacrylamide gel. Kamei found that a fragment of CBP (the N- 
terminal 100 amino acids) was capable of binding to the nuclear 
receptor-Kgand-oUgonucleotide complex and shifting the complex's 
position on the gel to an even higher molecular weight. 

25 (4) Kamei was able to co-immunoprecipitate CBP using 

antibodies to nuclear receptors in extracts from a variety of cells in the 
presence of ligand. 

(5) By the use of transcriptional activation assays, Kamei 
was able to demonstrate that nuclear receptors and CBP interact in a 
30 functional manner. Such transcriptional activation assays can indicate 
that two proteins are involved in a pathway that results in 
transcriptional activation but these assays do not prove that the 
interaction between the proteins is one of direct binding. 

By the above-described methods, Kamei was able to 
demonstrate specific binding interactions between CBP and the retinoic 
acid receptor (RAR), glucocorticoid receptor (GR), thyroid hormone 

-4- 
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receptor (T3R), and retinoid X receptor (KXR). Kamei also demonstrated 
specific binding between the N-tenninus of p300 and RAR. However, 
Kamei did not demonstrate specific binding between CBP, p300, or any 
other nuclear receptor co-activators and PPARs. 
5 What is striking about the methods used by Kamei is their 

extremely laborious and time consuming nature. Such methods 
involve, among other things, the construction of fusion proteins, the 
preparation of 32p-labeled proteins, the construction of specialized 
expression vectors for the yeast two-hybrid assay and the transcriptional 

10 activation assays, the running of many gels, and the raising of 

antibodies. Most of these assays take days to carry out and preparing the 
reagents needed to carry them out may take weeks. Because of the 
complicated reagents that are involved in these assays and the time 
needed to prepare and run the assays, these assays tend to be costly. 

15 Investigators other than Kamei who have studied the interaction 

between nuclear receptors and CBP have also been forced to rely on such 
cumbersome methods (see, e.g. f Chakravarti et al., 1996, Nature 383:99- 
103; Hanstein et al., 1996, Proc. Natl. Acad. Sri. USA 93:11540-11545; 
Heery et al., 1997, Nature 387:733-736). 

20 Kamei did not use the above-described methods to identify 

novel agonists or antagonists of nuclear receptors. The focus of Kamei 
was not on agonists or antagonists, but rather on the interaction 
between nuclear receptors and CBP. Although modifying the methods 
of Kamei to identify agonists or antagonists might be possible, such 

25 methods would suffer from serious disadvantages. This is because, as 
discussed above, all of the assays employed by Kamei to study the 
interaction of CBP and p300 with nuclear receptors are very laborious, 
slow, and costly. Given the therapeutic importance of steroid hormones 
such as estrogen, Cortisol, progesterone, and other nuclear receptor 

30 agonists such as thyroid hormone and antidiabetic thiazolidinedione 
compounds, the need for improved high-throughput screening assays to 
identify potential pharmaceutical compounds affecting nuclear 
receptors is clear. Historically, therapeutically useful nuclear receptor 
ligand compounds were identified by screening finim^l models, an 

35 approach which is even more labor intensive and time consuming than 
the meth ds used by Kamei. Also, approaches such as those used by 

-5- 
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Kamei are ill-suited for the identification of antagonists of nuclear 
receptors. It is now widely appreciated that antagonists of nuclear 
receptors can be valuable therapeutic agents. Examples of such 
therapeutically useful antagonists are tamoxifene, raloxifene, and RU- 
486. 

What is needed is a high throughput, time and labor- 
saving, non-radioactive, inexpensive, and very reliable assay for the 
identification and characterization of both agonists and antagonists of 
nuclear receptors. Such an assay is provided by the present invention. 

SUMMARY OF THE INVENTION 

The present invention provides novel methods of identifying 
agonists and antagonists of nuclear receptors. The methods take 
advantage of the agonist-dependent binding of nuclear receptors and 
CBP, p300, or other nuclear receptor co-activators. In the absence of 
agonist, binding between the nuclear receptor and CBP, p300, or other 
nuclear receptor co-activators does not occur. If agonist is present, 
however, such binding occurs and can be detected by fluorescence 
resonance energy transfer (FRET) between a fluorescentiy-labeled 
nuclear receptor and fluorescently-labeled CBP, p300, or other nuclear 
receptor co-activator. Antagonists can be identified by virtue of their 
ability to prevent or disrupt the agonist-induced interaction of nuclear 
receptors and CBP, p300, or other nuclear receptor co-activators. In 
contrast to prior art methods of identifying agonists and antagonists of 
nuclear receptors, the methods of the present invention, are simple, 
rapid, and less costly. 

The present invention provides a nuclear receptor or ligand 
binding domain thereof labeled with a fluorescent reagent for use in the 
above-described methods of identifying agonists and antagonists of 
nuclear receptors. The present invention also provides CBP, p300, or 
other nuclear receptor co-activator, or a binding portion thereof, labeled 
with a fluorescent reagent. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates a method of fluorescentlv l»fcoir 

Table 1. ^ 2 mUStrateS ^ folmat for -P^nts 1 and 2 of 
Table 1. * mU8trate8 f0Imat f0r 3 of 

10 Tablel. ^ 4m ^ tesAefo ^forexpe rim ent4of 

' x - - Troghtazone; -0- = BRL49653 

(SEQ.ID.NO^r ?A 8h ° WS ^ amiD0 8 ~ o^-an CBP 

Figure 7B shows the nucleotide sequence nf» nriM* 

encc<hnghumanCBP(SEQ.IDNO -2) Th. 1 f A 

20 positions 76-1290. * luJ,0 -». The open reading frame is at 

PPARo (SE^o 8 ^ ^ "~ «»™»» 

en^drnghu^P^^ 
25 positions 217-1623. *** JI,Jra -« The open reading frame is at 

PPARrl (»? m ~e of human 

encoding ~ " I ^ 

30 positions 173-1609. ^ N0..6). The open reading frame is at 

PPAR5 (SEQJDJ^^7X S ^ 0Wa **** ammo aci< * sequence of human 

encoding n^^^ • cDNA 

35 positions 338-1663. ^ NO " 8 >- The open reading frame is at 
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DETAILED DESCRIPTION OP THE INVENTION 
For the purposes of this invention- 

other nuclear r a «ceptor and CBP, p300, or 

°ound to the nudeM **— • ^^scent reagent 

20 n«cIearrecep toco ^*°"^^»« f CBI'.P300,or. ai(!r 

25 ^ rt «^oni,,„ ft henX^ entt0WndM 

- ^ZZ^Z^T-^ ' "<* ^PU., tune 
«- the i^on ^^"^^ -^.e ^ 
antagonists of nuclear h " h " 8Mliste •»* 

W-ding protein is . co-aetiX^' ? m<>ti " ,r nUdMr ""H- 

Bgand binding domain .TZrf " " reTODt ■**« beh '«" 

nucle^receptor LaZ^ T^7 ^ C »-P300, or other 

« "ceptor and CBP, p&K) rothe^ m '* ra caon between the nuclear 
— *»d byobse^La, "^"^^^activatoris 

' ^ 0C ™ effluence resonance energy 
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transfer (FRET) between two fluorescent reagents. One fluorescent 
reagent is bound to the nuclear receptor; the other fluorescent reagent is 
bound to CBP, p300, or other nuclear receptor co-activator. The binding 
of fluorescent reagent to nuclear receptor, CBP, p300, or other nuclear 
5 receptor co-activator can be by a covalent linkage or a non-covalent 
linkage. 

The present invention makes use of fluorescence resonance 
energy transfer (FRET). FRET is a process in which energy is 
transferred from an excited donor fluorescent reagent to an acceptor 

10 fluorescent reagent by means of intermolecular long-range dipole-dipole 
coupling. FRET typically occurs over distances of about lOe to lOOe and 
requires that the emission spectrum of the donor reagent and the 
absorbance spectrum of the acceptor reagent overlap adequately and that 
the quantum yield of the donor and the absorption coefficient of the 

15 acceptor be sufiBcientiy high. In addition, the transition dipoles of the 
donor and acceptor fluorescent reagents must be properly oriented 
relative to one another. For a review of FRET and its applications to 
biological systems, see Clegg, 1995, Current Opinions in Biotechnology 
6:103-110. 

20 The present invention makes use of a nuclear receptor or 

ligand binding domain thereof labeled with a first fluorescent reagent 
and CBP, p300, or other nuclear receptor co-activator, or a binding 
portion thereof, labeled with a second fluorescent reagent. The second 
fluorescent reagent comprises a fluorophore capable of undergoing 

25 energy transfer by either (a) donating excited state energy to the first 
fluorescent reagent, or (b) accepting excited state energy from the first 
fluorescent reagent In other words, according to the present invention, 
either the first or the second fluorescent reagents can be the donor or the 
acceptor during FRET. 

30 The first and second fluorescent reagents are 

spectropscopically complementary to each other. This means that their 
spectral characteristics are such that excited state energy transfer can 
occur between them. FRET is highly sensitive to the distance between 
the first and second fluorescent reagents. For example, FRET varies 

35 inversely with the sixth power of the distance between the first and 
second fluorescent reagents. In the absence of agonist, the first 
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fluorescent reagent, bound to the nuclear receptor or ligand binding 
domain thereof, will not be near the second fluorescent reagent, bound to 
CBP, p300, or other nuclear receptor co-activator, or binding portion 
thereof. Thus, no FRET, or very little FRET, will be observed. In the 
presence of agonist, however, interaction between the nuclear receptor 
and CBP, p300, or other nuclear receptor co-activator will occur, thus 
bringing dose together the first and the second fluorescent reagents, 
allowing FRET to occur and be observed. 

Accordingly, the present invention provides a method of 
identifying an agonist of a nuclear receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; and 

15 (c) a substance suspected of being an agonist of the 

nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and CBP, p300, or other nuclear receptor co- 
activator, or a binding portion thereof, will occur; and 

(d) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that tile substance 
is an agonist of the nuclear receptor. 

In particular embodiments, the nuclear receptor is selected 
from the group consisting of steroid receptors, thyroid hormone 
receptors, retinoic acid receptors, peroxisome proliferator-activated 
receptors, retinoid X receptors, glucocorticoid receptors, vitamin D 
receptors, and "orphan nuclear receptors 9 such as LXR, FXR, etc. 

In a particular embodiment, the nuclear receptor or ligand 
bmding domain thereof is a full-length nuclear receptor. In another 
einbodiment, the nuclear receptor or ligand binding domain thereof is a 
ligand binding domain of a nuclear receptor. In another embodiment, 
the nuclear receptor or ligand binding domain thereof comprises an AF- 
35 2 site of a nuclear receptor. 
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In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof is a full-length PPAR. In another embodiment, 
the nuclear receptor or ligand binding domain thereof is the ligand 
binding domain of a PPAR. In a further embodiment, the PPAR is 
selected from the group consisting of PPARcc, PPARyl, PPARy2, and 
PPAR5. Inafiu^erenmodiment,theKgandDmdingdoniamofthe 
PPAR contains amino acid residues 176-478 of human PPARyl. 

In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof contains amino acids 143-462 of human RARcc. 
In another embodiment, the nuclear receptor or ligand binding domain 
thereof contains amino acids 122-410 of rat T 3 Ral. In another 
embodiment, the nuclear receptor or ligand binding domain thereof 
contains amino acids 227-463 of mouse RXRy. In another embodiment, 
the nuclear receptor or ligand binding domain thereof contains amino' 
15 acids 251-595 of Human ER. 

In a particular embodiment, the above-described methods 
utilize full-length CBP, either mouse or human. In other embodiments, 
the methods utilize amino acid residues 1-113 of human CBP. In 
another embodiment, the methods utilize amino acid residues 1-453 of 
20 human CBP. 

The conditions under which the methods described above 
are carried out are conditions that are typically used in the art for the 
study of protein-protein interactions: e.g., physiological pH; salt 
conditions such as those represented by such commonly used buffers as 
25 PBS; a temperature of about 4°C to about 55°C. The presence of 

commonly used non-ionic detergents, e.g., NP-40®, sarcosyl, Triton X- 
100®, is optional. When europium cryptates are used as fluorescent 
reagents, reactions should contain KFata concentration of at least 200 
mM. 

30 Heeryetal., 1997, Nature 387:733-736 showed that 

interactions between nuclear receptors and a variety of nuclear receptor 
co-activators are mediated by a short amino acid sequence in the nuclear 
receptor co-activators having the amino acid sequence LXXLL, where L 
is leucine and X represents any amino acid. Accordingly, the present 

35 invention can be practiced with a binding portion of a nuclear receptor 
co-activator, provided that the binding portion contains the amino acid 

-11- 
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sequence LXXLL. Therefore, the present invention includes a method of 
identifying an agonist of a nuclear receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 
5 (b) a binding portion of a nuclear receptor co-activator, 

where the binding portion contains the amino add sequence LXXLL, 
and where the binding portion is labeled with a second fluorescent 
reagent; and 

(c) a substance suspected of being an agonist of the 
10 nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the binding portion of the nuclear receptor 
co-activator will take place; and 
15 W) measuring fluorescence resonance energy transfer 

(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

In a particular embodiment, the nuclear receptor co- 
20 activator is selected from the group consisting of: human RIP- 140, 

human SRC- 1, mouse TIF-2, human or mouse CBP, human or mouse 
p300, mouse TIF-1, and human TRIP proteins. 

In a particular embodiment, the nuclear receptor co- 
activator is human RIP-140 and the binding portion indudes a 
25 contiguous stretch of amino acids of human RIP-140 selected from the 
group consisting of: positions 20-29, 132-139, 184-192, 266-273, 379-387, 
496-506, 712-719, 818-825, 935-944, and 935-942. 

In another embodiment, the nuclear receptor co-activator is 
human SRC-1 and the binding portion indudes a contiguous stretch of 
30 amino acids of human SRC-1 selected from the group consisting of: 
positions 45-53, 632-640, 689-696, 748-755, and 1434-144L 

In another embodiment, the nuclear receptor co-activator is 
mouse TIF-2 and the binding portion includes a contiguous stretch of 
amino acids of mouse TIF-2 selected from the group consisting of: 
35 positions 640-650, 689-699, and 744-754 

-12- 
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acios or Human or mouse CBP salu4«i <t_ ^ 

5 ^'^^P^nom 68-78 and 356^36* SeIeCted *>»» *• S™P 

5 hunuu,o r nJL^X1^^ nUdeM ^^^- 
of amino S^unl^ P ° rti<,n fadudM ' — *— 

'0 mouse TO-lt^el^^ 

amino adds J^!!?? n **' * ««•— *-* of 

mouse containing positions 722-732 

amino adds of human T^t^*" """"^ * *** * 

15 containing positions 23-33 

human TBIP3 anT^m^l'' *° ~*— * 

amino acids of hun2 ^ " S,retA ° f 

01 numan TRIPS containing positions 97-107 

human «^ Jl^ZS^ Tf" ~»— " 

» amu M addsofhuJm T ^^ OT ^^»-'*^^of 
uman containing positions 36-46 

human m£Z££?Z!! ** -*» ™ -activator is 

amino adds of numt * Streh * rf 

01 numan r«P5 containing positions 26-36 

25 human TMPS^ttrj^^* 6 "^^^^^ « 

amino J^ZZ^?^ ' — — of 

numan TRIPS containinj positions 36-46 

human TBmZ^^T^ ""^ ^activator is 

amino « «~- *-* - 

30 ^^73^,20*266^^8^ ^ * e ^ -"m-Mng of: 

« Yao at a. ^l^ZStTTT " ""W -activators, 

EMBOJ. 14:3741-3751 nap ,J?Z , ' Ca ™ll*» et al, 1995, 

35 <TIF-2> KwJetaT i^f ^ ^ et < ^EMBO J- mol-108 
^*o-«9 (CBP), Eckner et al., 1994, Genes Dev. 8:869-884 
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SSUV* D0Uarin 6t *- 1995 ' EMB0 J ' 14:2020-2033 CHF-l> Lee et al 
1995, Natare 374:91-94 (TRIP proteins). ' 

, , ^P^ 1 ^ embodiments of the present invention 
desmbed above are aU particular embodiments of a more general 

iT^of d 8 ^^-tbod 

3 meth ° d of ^entrfymg an agonist of a nuclear receptor that 
comprises providing: 

labeled with ^ tT*"* ^ ***** domai ° ^reof 

laoeled with a first fluorescent reagent; 

IXYTT * a P° lype P tidecon te^g the amino acid sequence 

^tr po,ypeptide is iabeied * a — fl ~ 

nuclear re ^ & 8 ^ ^ ^ ^ ^ ° f ^ 

^ conditions such that, if the substance is an agonist of 
£e nuclear receptor, binding between the nuclear receptor or ZJ 
binding domain thereof and the polypeptide will take place- and 

(FRET) w + , m ! aSUring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents- 

where the occurrence of FRET indicates that'the substance 
is an agonist of the nuclear receptor. substance 

In a particular embodiment, the amino acid sequence 

18 manah6liCal POrtion of * e Polypeptidelb another 

embodiment, the amino acid sequence LXXLL is present in an „ heh^l 
portion of the polypeptide and the leucines form a hydrophobic face 
The present invention provides methods for identifying 

of the antagonist to prevent the occurrence of agonist-induced binding 

a^r 8 °! 7 reCCPtor CBP ' ^ ° r 0ther receptorl 

activator.ortodisruptsuchbmdmgaftorith^ 

present invention provides a method for identifying antagonists of 

nuclear receptors that comprises providing: 

lflh<a , , a / ud ^ rece P^^Hgandbmdingdoinain thereof 

labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator or a 
landing portion thereof, labeled with a second fluorescent reagen^ 
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(c) an agonist of the nuclear receptor, and 

(d) a substance suspected of being an antagonist of the 
nuclear receptor; 

under conditions such that, in the absence of the substance, 
binding between the nuclear receptor or ligand binding domain thereof 
and CBP, p300, or other nuclear receptor co-activator, or a binding 
portion thereof will occur; and 

(e) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents when the 
substance is present and measuring FRET between the first and second 
fluorescent reagents when the substance is absent; 

where the a decrease in FRET when the substance is 
present indicates that the substance is an antagonist of the nuclear 
receptor. 

In particular embodiments, the nuclear receptor is selected 
from the group consisting of steroid receptors, thyroid hormone 
receptors, retinoic acid receptors, peroxisome proliferator-activated 
receptors, retinoid X receptors, glucocorticoid receptors, vitamin D 
receptors, and "orphan nuclear receptors* such as LXR, FXR, etc. 

In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof is a full-length nuclear receptor. In another 
embodiment, the nuclear receptor or ligand binding domain thereof is a 
ligand binding domain of a nuclear receptor. In another embodiment, 
the nuclear receptor or ligand binding domain thereof is an AF-2 site of 
a nuclear receptor. 

In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof is a full-length PPAR. In another embodiment, 
the nuclear receptor or ligand binding domain thereof is the ligand 
binding domain of a PPAR. In a further embodiment, the PPAR is 
selected from the group consisting of PPARa, PPARy, and PPAR5. In a 
further embodiment, the ligand binding domain of the PPAR contains 
amino acid residues 176-478 of human PPARyl. 

In a particular embodiment, the nuclear receptor or ligand 
binding domain thereof contains amino acids 143-462 of human RARa. 
In another embodiment, the nuclear receptor or ligand binding domain 
thereof contains amino acids 122-410 of rat T Jbal. In another 
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embodiment, the nuclear receptor or ligand binding domain thereof 
contains amino acids 227-463 of mouse RXRy. In another embodiment, 
the nuclear receptor or ligand binding domain thereof contains amino 
acids 251-595 of human ER. 

In a particular embodiment, the above-described methods 
utilize full-length CBP, either mouse or human. In other embodiments, 
the methods utilize amino acid residues 1-113 of human CBP. In 
another embodiment, the methods utilize amino acid residues 1-453 of 
human CBP.. 

The conditions under which the methods described above 
are carried out are conditions that are typically used in the art for the 
study of protein-protein interactions: e.g. , physiological pH; salt 
conditions such as those represented by such commonly used buffers as 
PBS; a temperature of about 4°C to about 55°C. The presence of 
15 commonly used non-ionic detergents, e.g. , NP-40® , sarcosyl, Triton X- 
100®, is optional. When europium cryptates are used as fluorescent 
reagents, reactions should contain KF at a concentration of at least 200 
mM. 

In principle, one could measure FRET by monitoring either 
(a) a decrease in the emission of the donor fluorescent reagent following 
stimulation at the donor's absorption wavelength and/or (b) an increase 
in the emission of the acceptor reagent following stimulation at the 
donor's absorption wavelength. In practice, FRET is most effectively 
measured by emission ratioing. Emission ratioing monitors the change 
25 in the ratio of emission by the acceptor over emission by the donor. An 
increase in this ratio signifies that energy is being transferred from 
donor to acceptor and thus that FRET is occurring. Emission ratioing 
can be measured by employing a laser-scanning confocal microscope. 
Emission ratioing is preferably done by splitting the emitted light from a 
sample with a dichroic mirror and measuring two wavelength bands 
(corresponding to the donor and the acceptor emission wavelengths) 
simultaneously with two detectors. Alternatively, the emitted fight can 
be sampled consecutively at each wavelength (by using appropriate 
filters) with a single detector. In any case, these and other methods f 
35 measuring FRET are well known in the art. 
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The extent of flurorescent resonance nergy transfer is measured as the 
ratio between these two values. Typically this ratio is multiplied by 
10,000 to give whole numbers. 

Other FRET donor-acceptor pairs are suitable for the 

5 practice of the present invention. For example, the following donor- 
acceptor pairs can be used: dansyl/fluorescein; fluorescein/rhodamine; 
tryptophan/aminocoumarin. 

The present invention provides a nuclear receptor or ligand 
binding domain thereof labeled with a fluorescent reagent for use in the 

10 above-described methods of identifying agonists and antagonists of 
nuclear receptors. The present invention also provides CBP, p300, or 
other nuclear receptor co-activator, or a binding portion thereof, labeled 
with a fluorescent reagent. 

In a particular embodiment, the nuclear receptor or ligand 

15 binding domain thereof is selected from the group consisting of PPARa, 
PPARy, PPAR5, a ligand binding domain of PPARa, PPARy, or PPAR5, 
and amino acid residues 176-478 of human PPARyl and the fluorescent 
reagent is selected from the group consisting of XL665 and Europium 
cryptate (Eu3+K). 

20 In a particular embodiment, CBP, p300, or other nuclear 

receptor co-activator is labeled with a fluorescent reagent selected from 
the group consisting of XL665 and Europium cryptate (Eu3+K). 

The following non-limiting examples are presented to better 
25 illustrate the invention. 

EXAMPLE 1 

rapping egression, and purification of human CBP and PPAR 

Proteus 

30 To test whether human CBP can interact with PPARs in an 

agonist-dependent maimer, we cloned the human cDNA fragments 
encoding the NH2-terminal 1-113 amino acids (hCBPl-113) and 1-453 
amino acids (hCBPl-453) of human CBP by the polymerase chain 
reaction (PCR). The DNA and amino acid sequences of human CBP are 
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disclosed in Borrow et al., 1996, Nature Genet 14:33-41 and in GenBank 
accession no. U47741. 

The primers used for hCBPl-113 were: 

5'-ACTCGGATCCAAGCCATGGCTGAGAACTTGCTGGACGG-3' 
5 (SEQ.ID.NO.:9)and 

5j-CACAAAGCTTAGGCCATGTTAGCACTGTTCGG-3' (SEQ.ID.NO.: 

These primers were expected to amplify a 0.9 kb DNA fragment. 
The primers for hCBPl-453 were- 

5'-ACTCGGATCCAAGCCATGGCTGAGAACTTGCTGGACGG-3' 
(SEQ.ID.NO.:9) and 

S'CTCAGTCGACTTATTGAATTCCACTAGCTGGAGATCC-S' 
(SEQ.ID.NO.:ll). 

These primers were expected to amplify a 1.5 kb DNA fragment. 
™ A _ The template for *• PCR reaction was a human fetal brain 
hbrarv (Sterne, Catalogue #IS 937227). Of course any 
human cDNA Hbrary from a tissue expressing CBP could have been 
used. The PCR amplified 0.9 kb and 1.6 kp DNA fragments which were 
digested with restriction endonucleases and ligated into pBluescript II 
vector. DNA sequencing analysis confirmed that the amplified 
fragments were identical to the corresponding published nucleic acid 
sequences of human CBP. 

Based on the publicly available sequences for human CBP 
ated above, other primers could be readily identified and prepared by 
those skilled in the art in order to amplify and clone other portions of 
cDNA encoding human CBP from appropriate cDNA libraries Once 
such portions of human CBP are produced, they could be used in the 
methods of the present invention in a manner similar to that described 
herein for hCBPl-113 and hCBPl-453. The amino acid sequence of 
human CBP is shown in Figure 7A; the nucleic acid sequence of the 
cDNA encoding human CBP is shown in Figure 7B. 

To express the polypeptides encoded by the PCR fragments 
vectors encoding fusion proteins of the polypeptides and glutathione S- ' 
toansferase (GST) were constructed and expressed in E. coli. The PCR 
fragments were subdoned into the expression vector pGEX (Pharmacia 
Biotech) to generate pGEXhCBPl-113 and pGEXhCBPl-453. 
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pGEXhCBPl-113 and pGEXhCBPl-453 were transfected into the DH5a 
strain of E. coli (GIBCO BRL) and the bacteria hosting either 
pGEXhCBPl-113 or pGEXhCBPl-453 were cultured in LB medium 
(GIBCO BRL) to a density of OD600 = 0.7-1.0 and induced for 

5 overexpression of the GST-CBP fusion proteins by addition of IPTG 
(isopropylthio-^-galactoside) to a final concentration of 0.2 mM, The 
IPTG induced cultures were further grown at room temperature for 2-5 
hrs. The cells were harvested by centrifugation for 10 min at 5000g. The 
cell pellet was used for GST-CBP fusion protein purification by following 

10 the procedure from Pharmacia Biotech using Glutathione Sepharose 
beads. hCBPl-113 and hCBPl-453 proteins were generated by cleaving 
the corresponding GST fusion proteins with thrombin. SDS- 
polyacrylamide gel electrophoresis analysis showed that the preparation 
from pGEXhCBPl-113 gave two polypeptide bands, with apparent 

IS molecular weight of 12 kd and 10 kd. The 12 kd band is the expected size 
of hCBPl-113 and the 10 kd band is most likely a premature translational 
termination product. The preparation from pGEXhCBPl-450 gave a 
single band with the expected size, 50 kd. 

cDNAs encoding full-length PPARa and PPARyl were 

20 subdoned into pGEX vectors for the production of GST-PPARa and GST- 
PPARyl fusion proteins in E.colL PPARyl was cloned from a human fat 
cell cDNA library (see Elbrecht et aL, 1996, Biochem. Biophys. Res. 
Comm. 224:431-437). A cDNA encoding the human PPARyl ligand 
binding domain (PPARylLBD; amino acids 176-478 of PPARyl) was 

25 subcloned from a modified pSG5 vector as a Xho I (site located in the N- 
terminus of the LBDV Xba I (site located in the pSG5 vector) fragment 
The Xba I site was blunt-ended with T4 DNA polymerase. The 1.1 kb 
fragment containing the LBD was purified from an agarose gel and 
ligated into pGEX-KG (see Guan & Dixon, 1991, Anal. Biochem. 192:262- 

30 267) that had been digested with Xho I and Hind III (the Hind III site 
had been blunt-ended with T4 DNA polymerase). This construct was 
used for the production of GST-hPPARylLBD and hPPARylLBD (the 
ligand binding domain cleaved free of GST). The overexpression and 
purification of PPARa, PPARyl, and PPARylLBD were as described 

35 above for CBP. 
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The DNA and amino acid sequences of human PPARa are 
disclosed in Schmidt et al., 1992, Mol. Endocrinol. 6:1634-1641 and in 
GenBank, accession no. L07592. See Figure 8A and 8B. 

The DNA and amino acid sequences of human PPARyl are 

^ T\ «^ ^ *" 1995 ' GMle ^ 4:281 " 2 "; <* et ^ MB5, Mol. 
Cell. B«,l. 15:1817-1825; Elbrecht et al., 1996, Biochem. Biophys Res 

STi ^t 1 " 4 ^ "** ^ aCCessio » "0804. See Figure 

9A and 9B. Human PPARy2 contains the same amino acid sequent 
human PPARyl except for an amino terminal addition of 24 amino acids 
(see Elbrecht et al., 1996, Biochem. Biophys. Res. Comm. 224-431-437) 
Thus the amino acid sequence of the ligand binding domain of human 
JPPARy2 is the same as the amino acid sequence of the ligand binding 
do^ of human PPARyl, although the numbering of A. amino adds 
differs (176-478 for human PPARyl and 200-502 for human PPARr2). 

The DNA and amino acid sequences of human PPAR8 are 
msdosed in Sher et al., 1993, Biochemistry 32:5598-5604 and in GenBank. 
accession no. L02932. See Figure 10A-C. 

EXAMPLE 2 

20 interaction hot™*™ PPAift , ^ d hfTRP ^ f „ 

Experiments were first conducted using hCBPl-113 and 
hPPARylLBD. Purified hPPARylLBD was biotinyla^ed with S^HS- 
LC-Bxotm (PIERCE) to a biotin:hPPARylLBD ratio of 3:1 according to the 
procedure provided by PIERCE. Purified hCBPl-113 was directly labeled 

w « T^TuT^* <EU3+K) by mettod m ^trated in Figure 1 
Biotm-labeledhPPARylLBD.EuS+K-labeledhCBPl-llS and 
^ptavidin-labeled XL665 (SA-XL665; from PACKARD*) were incubated 
together in the presence or absence of 1 uM of known PPARy agonist 
(BRL49653 or AD5075). 

Thus, this experimental format made use of the fluorescent 
reagent pair europium cryptate (EuS+K), which acted as donor, and 
XL665, which acted as acceptor. hCBPl-113 was directly labeled with 
europium cryptate (Eu3 + K); hPPARylLBD was indirectly labeled with 
XL665 by means of a biotin-streptavidin link. The emission maximum 
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of europium cryptate (Eu3 + K) overlaps with the absorption maximum of 
XL665. Therefore, when europium cryptate (Eu3+K) and XL665 are in 
dose proximity, and the sample is fflumhiated with light at 337 nm (the 
absorption maximum of europium cryptate (Eu3+K)), FRET can occur 
between europium cryptate (Eu3+K) and XL665. This FRET manifests 
itself as increased emission at 665 nm by XL665. Figure 2 shows a 
schematic of the format used in this experiment (experiment 1 of Table 
1). When agonist is bound to hPPARylLBD, a specific interaction occurs 
between hPPARylLBD and hCBPl-113, thus bringing europium cryptate 
(Eu3+K) and XL665 into close enough proximity for FRET to occur In 
the absence of agonist, no interaction occurs between hPPARylLBD and 
hCBPl-113 and thus europium cryptate (Eu3+K) and XL665 are not 
brought into close proximity and no FRET occurs. When FRET occurs 
the amount of light given off by the sample at the emission maximum of 
XL665 (665 nm) is increased relative to the amount of light given off by 
the sample at the emission maximum of europium cryptate (Eu3+K) 
(620 nm). Therefore, measuring the ratio of emission at 665 nm to 620 
xim in the presence and the absence of a substance suspected of being an 
agomst allows for the determination of whether that substance actually 
is an agonist If the substance is an agonist, an increase in the ratio of 
emission at 665 nm to 620 nm in the presence of the substance will be 
observed. 

Reactions were carried out in microtiter plates. Reaction 
conditions were: appropriate volume (total 250 ul) of the reaction buffer 
(either PBS or HEPES, see below, containing 500 mM KF, 0.1% bovine 
serum albumin, BSA) was added to each well, followed by addition of 
hgands (BRL49653 or AD5075 at a final concentration of 1 uM and 0.1% 
dimethylsulfoxide (DMSO) or vehicle control (0.1% DMSO) Eu3+K 
labeled hCBP (100 nM), biotin-hPPARylLBD (100 nM), and streptavidin- 
labeled XL665 (100 nM) to appropriate wells. After mixing, 200ulof 
reaction mixture was transferred to a new well. The plate was either 
directly measured for fluorescence resonance energy transfer (FRET) or 
covered with sealing tape (PACKARD) to avoid evaporation and 
incubated at room temperature for up to 24 hrs before measuring FRET. 

The results of this experiment and others described below 
yielded ratio values as follows: 
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Table 1 







^mission ratio 
with AD5075 


Emission ratio 
with vehicle 


1 


PBS 


1134 


1074 


2 


HEPES + 0.05% 
NP40 


967 


617 


3 


HEPES + 0.05% 
NP40 


1078 


536 


4 


HEPES + 0.05% 
CHAPS 


1883 


487 
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Experiment 1 of Table 1 was carried out using PBS (137 mM 
NaCl, 2.7 mM KC1, 4.3 mM Na2HP04, 1.4 mM KH2PO4, pH 7.4). The 
greater emission ratio observed in the presence of AD5075 demonstrated 
that a specific interaction between hCBPl-113 and hPPARylLBD 
5 occurred in the presence of the agonist AD5075. Although it was clear 
that FRET was occurring, the signal-noise ratio was small. In 
experiment 2 of Table 1, HEPES buffer (N-2-hydro3yethylpipera2ine-N , - 
2-ethane sulfonic acid, 100 mM, pH 7.0) containing 0.05% NP40 (Nonidet 
P-40) was used instead of PBS and an improved signal-noise ratio was 
10 obtained. 

In order to get an even better signal-noise ratio, the above- 
described format was modified slightly for experiment 3. In experiment 
3, SA-XL665 (500 nM), biotin-labeled hPPARylLBD (100 nM), GST- 
hCBPl-113, and Eu3+K labeled anti-GST antibody (2.5 were incubated 

15 in the presence or absence of AD5075 (1 pM) in HEPES buffer containing 
0.05% NP40. A two-fold signal- noise ratio was obtained. Figure 3 shows 
a schematic of the format used in experiment 3. 

The anti-GST antibody was a goat antibody to GST from 
Pharmacia (catalogue number 27-4577-01) that was labeled with Eu3+K 

20 according to the procedure summarized below. 

- Make up @ 10 
mg/mL in H20. 
Need 4Z2pg (4.2 
pL, 96.6 nmol) for 
49.0 pg Eu3+ 
Reagent 




-Resuspended@ ' -FW=1465 2.9 Equiv SULFO-SMCC, 

2.5 mg/mL in 10% Use 49.0 pg 20 mM Pi buffer, 10% DMF 

DMF/PBS (19.6 pL, 33.4 nmol) RT, 30 minutes 
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5 equiv. Eu3+ 
complex 



Anti-GST Antibody, . 
Cat #27-4577-01 



J H 




1) Add Eu reagent 

2) [Pr] 4.2 mg/mL. 
Onightat4°C 

Lower pH: Add 12 pL of 
lMNaPi,pH7.0. 
pH drops to 7.18. 

350 pM TCEP 

(35 mM stock is 

10.0 mg/mL- PBS, pH 

7.0), 2.4 pL, 

15 min rt then 15 min @ 

4°C 



Anti-GST Antibody, 
Cat #27-45774)1 



From Pharmacia, 5,0 mg/mL, 
FW = 150 kD Use 200 pL (1 mg, 
6.66 nmol) exchange into 10 mM 
Borate, 350 mM Nad, 10% Gly, 
pH8.5 with BioSpin-30 



5.0 Equiv SPDP, 
RT, 5 hours 



FW =312, Dissolve 
@ 1.00 mg/mL in EtOH. 
Add 10.4 pL (5 equiv., 
10.4 pg, 33.4 nmol) to 
protein. 



To further improve the signal to noise ratio, a series of 
experiments were conducted. Experiment 4 of Table 1 exemplifies 

5 results obtained from those efforts. cDNA encoding a longer fragment of 
hCBP was doned and expressed to get hCBPl-453. hCBPl-453 was 
biotinylated. Biotm-labeled hCBPl-453 (25 nM), SA-XL665 (100 nM), 
GST-hPPARylLBD (1 nM), and Eu3+K-labeled anti-GST antibody (2 nM) 
were mixed together in the presence or absence of 1 |iM AD5075. The 

10 detergent was changed from 0.05% NP40 to 0.5% CHAPS (3-{[3- 

cholamidopropyl]dimethyl-ammomol}-l-propanesulfonate). A three- to 
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four-fold signal-noise ratio was obtained. Figure 4 shows the strategy 
used for experiment 4 and similar experiments. 

The correlation between results from the above-described 
assays and previously reported results from in vitro binding and 
5 transcriptional activation assays of selected antidiabetic insulin 

sensitizers that are known to be PPARy agonists (Elbrecht et al , 1996 
Biochem Biophys Res Comm 224:431-437) was analyzed by titrating those 
known PPARy agonists in the assays described above and comparing 
EC50s so obtained with previously described values for potency in 
10 binding or transcriptional activation assays for the known agonists The 
results are shown in Figure 5. From Figure 5, the following EC 50 s can 
be derived: 

AD5075 = 8 nM 
BRL49653 = 53 nM 
15 Troglitazone = 646 nM 

Pioglitazone = 890 nM. 
These EC50s generated in the above-described assays are in close 
agreement with those generated by in vitro binding and transcriptional 
activation studies (Elbrecht et al., 1996, Biochem Biophys Res Comm 
20 224:431-437). 

The above-described assay can also be used to characterize 
the interaction between nuclear receptors with co-activators as, e.g by 
determining the binding constant for that interaction. Figure 6 shows 
an example of such an application. Saturating amounts of PPARy 
agonist (10 uM BRL49653) were used. Increasing concentrations of non- 
biofanylated hCBPl-453 were used to titrate away biotin-hCBP- 
PPARylLBD complex and decrease the fluorescence energy transfer. A 
Kd of 300 nM for the interaction between hCBPl-453 and PPARylLBD 
can be derived from the results illustrated in Figure 6 and this Kd (300 
nM) is a measurement ofthe affinity between CBP and PPARy. 

The present invention is not to be limited in scope by the 
specific embodiments described herein. Indeed, various modifications 
of the invention in addition to those described herein will become 
apparent to those skilled in the art from the foregoing description. Such 
35 modifications are intended to fall within the scope ofthe appended 
claims. 
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WHAT IS CLAIMED: 

1. A method of identifying an agonist of a nuclear 
receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
5 labeled with a first fluorescent reagent; 

(b) CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; and 

(c) a substance suspected of being an agonist of the 
nuclear receptor; 

10 under conditions such that, if the substance is an agonist of 

the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and CBP, p300, or other nuclear receptor co- 
activator, or a binding portion thereof, will occur, and 

(d) measuring fluorescence resonance energy transfer 
15 (FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

2. The method of claim 1 where the nuclear receptor or 
20 ligand binding domain thereof is selected from the group consisting of 

steroid receptors, thyroid hormone receptors, retinoic acid receptors, 
peroxisome proliferator-activated receptors, retinoid X receptors, 
glucocorticoid receptors, vitamin D receptors, LXR, and FXR. 

25 3. The method of claim 1 where the nuclear receptor or 

ligand binding domain thereof is selected from the group consisting of a 
full-length nuclear receptor, a ligand binding domain of a nuclear 
receptor, and an AF-2 site of a nuclear receptor. 

30 4. The method of claim 1 where the nuclear receptor or 

ligand binding domain thereof comprises an AF-2 site of a nuclear 
receptor. 

5. The method of claim 1 where the nuclear receptor or 
35 ligand binding domain thereof is selected from the group consisting of a 
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full-length PPAR, a ligand binding domain of a PPAR, and amino acid 
residues 176-478 of human PPARyl. 

6. The method of claim 1 where the nuclear receptor or 
5 ligand binding domain thereof is selected from the group consisting of 

PPARa, PPARyl, PPARy2, and PPAR8. 

7. The method of claim 1 where the nuclear receptor or 
ligand binding domain thereof comprises a ligand binding domain 

10 selected from the group consisting of amino acids 143-462 of human 
RARa, amino acids 122-410 of rat T3Ral, amino acids 227-463 of mouse 
RXRy, and amino acids 251-595 of human ER. 

8. The method of claim 1 where CBP,p300, or other 
15 nuclear receptor co-activator, or a binding portion thereof is selected 

from the group consisting of full-length human CBP, full-length mouse 
CBP, amino acid residues 1-113 of human CBP, and amino acid residues 
1-453 of human CBP. 

20 9. The method of claim 1 where the first fluorescent 

reagent is selected from the group consisting of XL665 and Europium 
cryptate (Eu3+KX 

10* The method of claim 1 where the second fluorescent 
25 reagent is selected from the group consisting of XL665 and Europium 
cryptate (Eu3+K). 

11. A method of identifying an agonist of a nuclear 
receptor that comprises providing: 
30 (a) a nuclear receptor or ligand binding domain thereof 

labeled with a first fluorescent reagent; 

(b) a binding portion of a nuclear receptor co-activator, 
where the binding portion contains the amino acid sequence LXXLL, 
and where the binding portion is labeled with a second fluorescent 
35 reagent; and 
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(c) a substance suspected of being an agonist of the 
nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
5 binding domain thereof and the binding portion of the nuclear receptor 
co-activator will take place; and 

(d) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
10 is an agonist of the nuclear receptor. 

12. The method of claim 11 where the binding portion of a 
nuclear receptor co-activator is selected from the group consisting of 
human RIP- 140, human SRC-1, mouse TIF-2, human or mouse CBP, 

15 human or mouse p300, mouse TDM, and human TRIP proteins. 

13. A method of identifying an agonist of a nuclear 
receptor that comprises providing: 

(a) a nuclear receptor or ligand binding domain thereof 
20 labeled with a first fluorescent reagent; 

(b) a polypeptide containing the amino add sequence 
LXXLL where the polypeptide is labeled with a second fluorescent 
reagent; and 

(c) a substance suspected of being an agonist of the 
25 nuclear receptor; 

under conditions such that, if the substance is an agonist of 
the nuclear receptor, binding between the nuclear receptor or ligand 
binding domain thereof and the polypeptide will take place; and 

(d) measuring fluorescent resonance energy transfer 
(FRET) between the first and second fluorescent reagents; 

where the occurrence of FRET indicates that the substance 
is an agonist of the nuclear receptor. 

14. A method for identifying an antagonist of a nuclear 
35 receptor that comprises providing: 
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(a) a nuclear receptor or ligand binding domain thereof 
labeled with a first fluorescent reagent; 

(b) GBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a second fluorescent reagent; 

(c) an agonist of the nuclear receptor; and 

(d) a substance suspected of being an antagonist of the 
nuclear receptor, 

under conditions such that, in the absence of the substance, 
binding between the nuclear receptor or ligand binding domain thereof 
and CBP, p300, or other nuclear receptor co-activator, or a binding 
portion thereof will occur; and 

(e) measuring fluorescence resonance energy transfer 
(FRET) between the first and second fluorescent reagents when the 
substance is present and measuring FRET between the first and second 
fluorescent reagents when the substance is absent; 

where the a decrease in FRET when the substance is 
present indicates that the substance is an antagonist of the nuclear 
receptor. 

15. The method of claim 14 where the nuclear receptor or 
ligand binding domain thereof is selected from the group consisting of 
steroid receptors, thyroid hormone receptors, retinoic acid receptors, 
peroxisome proliferator-activated receptors, retinoid X receptors, 
glucocorticoid receptors, vitamin D receptors, LXR, and FXR. 

16. The method of claim 14 where the nuclear receptor or 
ligand binding domain thereof is selected from the group consisting of a 
full-length nuclear receptor, a ligand binding domain of a nuclear 
receptor, and an AF-2 site of a nuclear receptor. 

17. The method of claim 14 where the nuclear receptor or 
ligand binding domain thereof comprises an AF-2 site of a nuclear 
receptor. 

18. The method of claim 14 wh re th nuclear receptor or 
ligand binding domain thereof is selected from the group consisting of a 
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full-length PPAR, a ligand binding domain of a PPAR, and amino acid 
residues 176-478 of human PPARyL 

19. The method of claim 14 where the nuclear receptor or 
5 ligand binding domain thereof is selected from the group consisting of 
PPARa, PPARyl, PPARy2, and PPARS. 



20. The method of claim 14 where the nuclear receptor or 
ligand binding domain thereof comprises a ligand binding domain 
10 selected from the group consisting of amino acids 143-462 of human 
RARo, amino acids 122-410 of rat T3Ral, amino acids 227-463 of mouse 
RXRy, and amino acids 251-595 of human ER. 



21. The method of claim 14 where CBP, p300, or other 
15 nuclear receptor co-activator, or a binding portion thereof is selected 

from the group consisting of full-length CBP, amino acid residues 1-113 
of human CBP, and amino acid residues 1-453 of human CBP. 

22. The method of claim 14 where the first fluorescent 
20 reagent is selected from the group consisting of XL665 and Europium 

cryptate (Eu3+K). 



23. The method of claim 14 where the second fluorescent 
reagent is selected from the group consisting of XL665 and Europium 

25 cryptate (Eu3+K). 

24. A nuclear receptor or ligand binding domain thereof 
labeled with a fluorescent reagent. 

30 25. The nuclear receptor or ligand binding domain 

thereof of claim 24 where the nuclear receptor or ligand binding domain 
thereof is selected from the group consisting of PPARa, PPARyl, 
PPARy2, PPAR5, a ligand binding domain of PPARa, PPARyl, PPARy2, 
or PPAR5, and amino acid residues 176-478 of human PPARyl and the 

35 fluorescent reagent is selected from the group consisting of XL665 and 
Europium cryptate (Eu3+K). 
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26. CBP, p300, or other nuclear receptor co-activator, or a 
binding portion thereof, labeled with a fluorescent reagent. 

27. The CBP, p300, or other nuclear receptor co-activator, 
or a binding portion thereof, of claim 26 where the fluorescent reagent is 
selected from the group consisting of XL665 and Europium cryptate 
(Eu3+K). 
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1 MAENLLDGPPNPKRAKLSSPGFSANDSTDFGSLFDLENDLPDELIPNGGELGLLNSGNLV 
61 PDAASKHKQLSELLRGGSGSSINPG1GNVSASSPVQQGLGGQAQGQPNSANMASLSAMGK 
121 SPLSQGDSSAPSLPKQAASTSGPTPAASQALNPQAQKQVGLATSSPATSQTGPGICMNAN 
181 FNQTHPGLLNSNSGHSL I NQASQGQAQVMNGSLGAAGRGRGAGMPYPTPAMQGASSSVLA , 
241 ETLTQVSPQMTGHAGLNTAQAGGMAKMGITGNTSPFGQPFSQAGGQPMGATGVNPQLASK 

301 qsmvnslptfptdikntsvtnvpnmsqmqtsvgivptqaiatgptadpekrkliqqqlvl 
361 llhahkcqrreqangevracslphcrtmknvLnhmthcqagkacq 

FIG.7A 



1 cgagccccga cccccgtccg ggccctcgcc ggccgcgccg cccgtgcccg gggctgtttt 
61 cccgagcagg tgaaaatggc tgagaacttg ctggacggac cgcccaaccc caaaagagcc 
121 aaactcagct cgcccggttt ctcggcgaat gacagcacag attttggatc attgtttgac 
181 ttggaaaatg atcttcctga tgagctgata cccaatggag gagaattagg ccttttaaac 
241 agtgggaacc ttgttccaga tgctgcttcc aaacataaac aactgtcgga gcttctacga 
301 ggaggcagcg gctctagtat caacccagga ataggaaatg tgagcgccag cagccccgtg 
361 cagcagggcc tgggtggcca ggctcaaggg cagccgaaca gtgctaacat ggccagcctc 
421 agtgccatgg gcaagagccc tctgagccag ggagattctt cagcccccag cctgcctaaa 
481 caggcagcca gcacctctgg gcccaccccc gctgcctccc aagcactgaa tccgcaagca 
541 caaaagcaag tggggctggc gactagcagc cctgccacgt cacagactgg acctggtatc 
601 tgcatgaatg ctaactttaa ccagacccac ccaggcctcc tcaatagtaa ctctggccat 
661 agcttaatta atcaggcttc acaagggcag gcgcaagtca tgaatggatc tcttggggct 
721 gctggcagag gaaggggagc tggaatgccg taccctactc cagccatgca gggcgcctcg 
781 agcagcgtgc tggctgagac cctaacgcag gtttccccgc aaatgactgg tcacgcggga 
841 ctgaacaccg cacaggcagg aggcatggcc aagatgggaa taactgggaa cacaagtcca 
901 tttggacagc cctttagtca agctggaggg cagccaatgg gagccactgg agtgaacccc 
961 cagttagcca gcaaacagag catggtcaac agtttgccca ccttccctac agatatcaag 
1021 aatacttcag tcaccaacgt gccaaatatg tctcagatgc aaacatcagt gggaattgta 
1081 cccacacaag caattgcaac aggccccact gcagatcctg aaaaacgcaa actgatacag 
1141 cagcagctgg ttctactgct tcatgctcat aagtgtcaga gacgagagca agcaaacgga 
1201 gaggttcggg cctgctcgct cccgcattgt cgaaccatga aaaacgtttt gaatcacatg 
1261 acgcattgtc aggctgggaa agcctgccaa 

FIG.7B 
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1 MVDTESPLCPLSPLEAGDLESPLSEEFLQEMGNIQEISQSIGEDSSGSFGFTEYQYLGSC 
61 PGSDGSVITDTLSPASSPSSVTYPVVPGSVDESPSGALNIECRICGDKASGYHYGVHACE 
121 GCKGFFRRTIRLKLVYDKCDRSCKIQKKNRNKCQYCRFHKCLSVGMSHNAIRFGRMPRSE 
181 KAKLKAE I LTCEHD I EDSETADLKSLAKRI YEAYLKNFNMNKVKARV I LSGKASNMPPFV 
241 I HDMETLCMAEKTLVAKLVANGIQNKEVEVRI FHCCQCTSVETVTELTEFAKAI PAFANL 
301 DLNDQVTLLKYGVYEAIFAMLSSVMNKDGMLVAYGNGFITREFLKSURKPFCDIMEPKFD 
361 FAMKFNALELDDSD I SLFVAAI I CCGDRPGLLNVGH I EKMQEG I VHVURLHLQSNHPDD I 
421 FLPKLLQKMADLRQLVTEHAQLVQI IKKTESDAALHPLLQEI YRDMY 

FIG.8A 

1 ggcccaggct gaagctcagg gccctgtctg ctctgtggac tcaacagttt gtggcaagac 
61 aagctcagaa ctgagaagct gtcaccacag ttctggaggc tgggaagttc aagatcaaag 
121 tgccagcaga ttcagtgtca tgtgaggacg tgcttcctgc ttcatagata agagtagctt 
181 ggagctcggc ggcacaacca gcaccatctg gtcgcgatgg tggacacgga aagcccactc 
241 tgccccctct ccccactcga ggccggcgat ctagagagcc cgttatctga agagttcctg 
301 caagaaatgg gaaacatcca agagatttcg caatccatcg gcgaggatag ttctggaagc 
361 tttggcttta cggaatacca gtatttagga agctgtcctg gctcagatgg ctcggtcatc 
421 acggacacgc tttcaccagc ttcgagcccc tcctcggtga cttatcctgt ggtccccggc 
481 agcgtggacg agtctcccag tggagcattg aacatcgaat gtagaatctg cggggacaag 
541 gcctcaggct atcattacgg agtccacgcg tgtgaaggct gcaagggctt ctttcggcga 
601 acgattcgac tcaagctggt gtatgacaag tgcgaccgca gctgcaagat ccagaaaaag 
661 aacagaaaca aatgccagta ttgtcgattt cacaagtgcc tttctgtcgg gatgtcacac 
721 aacgcgattc gttttggacg aatgccaaga tctgagaaag caaaactgaa agcagaaatt 
781 cttacctgtg aacatgacat agaagattct gaaactgcag atctcaaatc tctggccaag 
841 agaatctacg aggcctactt gaagaacttc aacatgaaca aggtcaaagc ccgggtcatc 
901 ctctcaggaa aggccagtaa caatccacct tttgtcatac atgatatgga gacactgtgt 
961 atggctgaga agacgctggt ggccaagctg gtggccaatg gcatccagaa caaggaggtg 
1021 gaggtccgca tctttcactg ctgccagtgc acgtcagtgg agaccgtcac ggagctcacg 
1081 gaattcgcca aggccatccc agcgttcgca aacttggacc tgaacgatca agtgacattg 
1141 ctaaaatacg gagtttatga ggccatattc gccatgctgt cttctgtgat gaacaaagac 
1201 gggatgctgg tagcgtatgg aaatgggttt ataactcgtg aattcctaaa aagcctaagg 
1261 aaaccgttct gtgatatcat ggaacccaag tttgattttg ccatgaagtt caatgcactg 
1321 gaactggatg acagtgatat ctcccttttt gtggctgcta tcatttgctg tggagatcgt 
1381 cctggccttc taaacgtagg acacattgaa aaaatgcagg agggtattgt acatgtgctc 
1441 agactccacc tgcagagcaa ccacccggac gatatctttc tcttcccaaa acttcttcaa 
1501 aaaatggcag acctccggca gctggtgacg gagcatgcgc agctggtgca gatcatcaag 
1561 aagacggagt cggatgctgc gctgcacccg ctactgcagg agatctacag ggacatgtac 
1621 tgagttcctt cagatcagcc acaccttttc caggagttct gaagctgaca gcactacaaa 
1681 ggagacgggg gagcagcacg attttgcaca aatatccacc actttaacct tagagcttgg 
1741 acagtctgag ctgtaggtaa ccggcatatt attccatatc tttgttttaa ccagtacttc 
1801 taagagcata gaactcaaat gctgggggag gtggctaatc tcaggactgg gaag 

FIG.8B 
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1 MTMVDTEIAFWPTNF6ISSVDLSVMEDHSHSFDIKPFTTVDFSSISTPHYEDIPFTRTDP 
61 VVADYKYDLKLQEYQSAIKVEPASPPYYSEKTQLYNKPHEEPSNSLMAIECRVCGDKASG 
121 FHYGVHACE6CKGFFRRTIRLKLIYDRCDLNCRIHKKSRNKCQYCRFQKCLAVGMSHNAI 
181 RFGRIAQAEKEKLLAEISSDIDQLNPESADLRQALAKHLYDSYIKSFPLTKAKARAILTG 
241 KTTDKSPFV I YDMNSLMMGEDKI KFKHITPLQEQSKEVAI RI FQGCQFRSVEAVQEITEY 
301 AKSIPGFVNLDLNDQVTLLKYGVHEIIYTMLASLMNKDGVLISEGQGFMTREFLKSLRKP 
361 FGDFMEPKFEFAVKFNALELDDSDLAIFIAVI ILSGDRPGLLNVKPIEDIQDNLLQALEL 
421 QLKLNHPESSQUAKLLQKMTDLRQIVTEHVQLLQVIKKTETDMSLHPLLQEIYKDLY 

FIG.9A 



1 ccgaccttac cccaggcggc cttgacgttg gtcttgtcgg caggagacag caccatggtg 

61 ggttctctct gagtctggga attcccgagc ccgagccgca gccgccgcct ggggggcttg 

121 ggtcggcctc gaggacaccg gagaggggcg ccacgccgcc gtggccgcag aaatgaccat 

181 ggttgacaca gagatcgcat tctggcccac caactttggg atcagctccg tggatctctc 

241 cgtaatggaa gaccactccc actcctttga tatcaagccc ttcactactg ttgacttctc 

301 cagcatttct actccacatt acgaagacat tccattcaca agaacagatc cagtggttgc 

361 agattacaag tatgacctga aacttcaaga gtaccaaagt gcaatcaaag tggagcctgc 

421 atctccaect tattattctg agaagactca gctctacaat aagcctcatg aagagccttc 

481 caactccctc atggcaattg aatgtcgtgt ctgtggagat aaagcttctg gatttcacta 

541 tggagttcat gcttgtgaag gatgcaaggg tttcttccgg agaacaatca gattgaagct 

601 tatctatgac agatgtgatc ttaactgtcg gatccacaaa aaaagtagaa ataaatgtca 

661 gtactgtcgg tttcagaaat gccttgcagt ggggatgtct cataatgcca tcaggtttgg 

721 gcggatcgca caggccgaga aggagaagct gttggcggag atctccagtg atatcgacca 

781 gctgaatcca gagtccgctg acctccgtca ggccctggca aaacatttgt atgactcata 

841 cataaagtcc ttcccgctga ccaaagcaaa ggcgagggcg atcttgacag gaaagacaac 

901 agacaaatca ccattcgtta tctatgacat gaattcctta atgatgggag aagataaaat 

961 caagttcaaa cacatcaccc ccctgcagga gcagagcaaa gaggtggcca tccgcatctt 

1021 tcagggctgc cagtttcgct ccgtggaggc tgtgcaggag atcacagagt atgccaaaag 

1081 cattcctggt tttgtaaatc ttgacttgaa cgaccaagta actctcctca aatatggagt 

1141 ccacgagatc atttacacaa tgctggcctc cttgatgaat aaagatgggg ttctcatatc 

1201 cgagggccaa ggcttcatga caagggagtt tctaaagagc ctgcgaaagc cttttggtga 

1261 ctttatggag cccaagtttg agtttgctgt gaagttcaat gcactggaat tagatgacag 

1321 cgacttggca atatttattg ctgtcattat tctcagtgga gaccgcccag gtttgctgaa 

1381 tgtgaagccc attgaagaca ttcaagacaa cctgctacaa gccctggagc tccagctgaa 

1441 gctgaaccac cctgagtcct cacagctgtt tgccaagctg ctccagaaaa tgacagacct 

1501 cagacagatt gtcacggaac acgtgcagct actgcaggtg atcaagaaga cggagacaga 

1561 catgagtctt cacccgctcc tgcaggagat ctacaaggac ttgtactagc agagagtcct 

1621 gagccactgc caacatttcc cttcttccag ttgcactatt ctgagggaaa atctgaccat 

1681 aagaaattta ctgtgaaaaa gcgttttaaa aagaaaaggg tttagaatat gatctatttt 

1741 atgcatattg tttataaaga cacatttaca atttactttt aatattaaaa attaccatat 

1801 tatgaaattg c 

FIG.9B 
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i MFQP0FEAPEVREEEEKEEVAEAE6APELNGGPQHALPSSSYTDLSRSSSPPSLLDQLQM 
fil GCPGASCGSLNMECRVCGDKASGFHYGVHACEGCKGFFRRTIRMKLEYEKCERSCKIQKK 

* nrKycrfqkclalgmshnairfgwea^^^^ 

t svhvfyrcqcttvetvreltefaksi psfsslflndqvtllkygvheai famlas I vnk 
dgllvangsgfvtreflrslrkpfsdiiepkfefavkfnalelddsdlalfiaaiilcgd 
361 rpglmnvprveaiqdtilralefhlqanhpdaqylfpkllqkmadlrqlvtehawqri 
tttsi hplloeiykdmy 



121 NRN 
181 
241 
301 DGL 



OUX im uu ..I....". — . 

421 KKTETETSLHPLLQEIYKDMY 



FIG. 1 0A 
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1 gaattctgcg gagcctgcgg gacggcggcg ggttggcccg taggcagccg ggacagtgtt 
61 gtacagtgtt ttgggcatgc acgtgatact cacacagtgg cttctgctca ccaacagatg 
121 aagacagatg caccaacgag ggtctggaat ggtctggagt ggtctggaaa gcagggtcag 
181 atacccctgg aaaactgaag cccgtggagc aatgatctct acaggactgc ttcaaggctg 
241 atgggaacca ccctgtagag gtccatctgc gttcagaccc agacgatgcc agagctatga 
301 ctgggcctgc aggtgtggcg ccgaggggag atcagccatg gagcagccac aggaggaagc 
361 ccctgaggtc cgggaagagg aggagaaaga ggaagtggca gaggcagaag gagccccaga 
421 gctcaatggg ggaccacagc atgcacttcc ttccagcagc tacacagacc tctcccggag 
481 ctcctcgcca ccctcactgc tggaccaact gcagatgggc tgtgacgggg cctcatgcgg 
541 cagcctcaac atggagtgcc gggtgtgcgg ggacaaggca tcgggcttcc actacggtgt 
601 tcatgcatgt gaggggtgca agggcttctt ccgtcgtacg atccgcatga agctggagta 
661 cgagaagtgt gagcgcagct gcaagattca gaagaagaac cgcaacaagt gccagtactg 
721 ccgcttccag aagtgcctgg cactgggcat gtcacacaac gctatccgtt ttggtcggat 
781 gccggaggct gagaagagga agctggtggc agggctgact gcaaacgagg ggagccagta 
841 caacccacag gtggccgacc tgaaggcctt ctccaagcac atctacaatg cctacctgaa 
901 aaacttcaac atgaccaaaa agaaggcccg cagcatcctc accggcaaag ccagccacac 
961 ggcgcccttt gtgatccacg acatcgagac attgtggcag gcagagaagg ggctggtgtg 
1021 gaagcagttg gtgaatggcc tgcctcccta caaggagatc agcgtgcacg tcttctaccg 
1081 ctgccagtgc accacagtgg agaccgtgcg ggagctcact gagttcgcca agagcatccc 
1141 cagcttcagc agcctcttcc tcaacgacca ggttaccctt ctcaagtatg gcgtgcacga 
1201 ggccatcttc gccatgctgg cctctatcgt caacaaggac gggctgctgg tagccaacgg 
1261 cagtggcttt gtcacccgtg agttcctgcg cagcctccgc aaacccttca gtgatatcat 
1321 tgagcctaag tttgaatttg ctgtcaagtt caacgccctg gaacttgatg acagtgacct 
1381 ggccctattc attgcggcca tcattctgtg tggagaccgg ccaggcctca tgaacgttcc 
1441 acgggtggag gctatccagg acaccatcct gcgtgccctc gaattccacc tgcaggccaa 
1501 ccaccctgat gcccagtacc tcttccccaa gctgctgcag aagatggotg acctgcggca 
1561 actggtcacc gagcacgccc agatgatgca gcggatcaag aagaccgaaa ccpagacctc 
1621 gctgcaccct ctgctccagg agatctacaa ggacatgtac taacggcggc acccaggcct 
1681 ccctgcagac tccaatgggg ccagcactgg aggggcccac ccacatgact tttccattga 
1741 ccagctctct tcctgtcttt gttgtctccc tctttctcag ttcctctttc ttttctaatt 
1801 cctgttgctc tgtttcttcc tttctgtagg tttctctctt cccttctccc ttctcccttg 
1861 ccctcccttt ctctctccta tccccacgtc tgtcctcctt tcttattctg tgagatgttt 
1921 tgtattattt caccagcagc atagaacagg acctctgctt ttgcacacct tttccccagg 
1981 agcagaagag agtgggcctg ccctctgccc catcattgca cctgcaggct taggtcctca 
2041 cttctgtctc ctgtcttcag agcaaaagac ttgagccatc caaagaaaca ctaagctctc 
2101 tgggcctggg ttccagggaa ggctaagcat ggcctggact gactgcagcc ccctatagtc 
2161 atggggtccc tgctgcaaag gacagtggca gaccccggca gtagagccga gatgcctccc 
2221 caagactgtc attgcccctc cgatcgtgag gccacccact gacccaatga tcctctccag 
2281 cagcacacct cagccccact gacacccagt gtccttccat cttcacactg gtttgccagg 
2341 ccaatgttgc tgatggcccc tccagcacac acacataagc actgaaatca ctttacctgc 
2401 aggcaccatg cacctccctt ccctccctga ggcaggtgag aacccagaga gaggggcctg 

FIG. 1 0B 



BNSOOCID: <WO 991B124A1 l_> 



WO 99/18124 



PCTVUS98/21049 



11/11 



2461 caggtgagca ggcagggctg ggccaggtct 
2521 tgggtcagcc cagcacctcg cccagtggga 
2581 ctgatgtcca tttgtcccaa tagctctact 
2641 tggccaccta gaagtctccc tgcacagcct 
2701 ccacaccgct ggtccctgcc ctcccctgct 
2761 gggccaaagc acagctgggc atgccatgtc 
2821 gggcaagggg ctggctggag tctcagagca 
2881 ggcttcctgg gtcctgcctg gtcctccctc 
2941 gaagtgccca gcccctgcct gacggnnnnn 
3001 cgctccccac ctacccagct gatgggggtt 
3061 ggacagctgc tgggacccac ctcccccctt 
3121 cccgggtctg gtgctgagga tacagctctt 
3181 atgtatattt ttgctaggag ccccagcttc 
3241 actgacgaaa ctttaaataa atgggaatta 
3301 c 



ccggggaggc aggggtcctg caggtcctgg 
gcttcccggg ataaactgag cctgttcatt 
gccctcccct tcccctttac tcagcccagc 
ctagtgtccg gggaccttgt gggaccagtc 
cccaggttga ggtgcgctca cctcagagca 
tgagcggcgc agagccctcc aggcctgcag 
cagaggtagg agaactgggg ttcaagccca 
ccaaggagcc attctatgtg actctgggtg 
nngatcactc tctgctggca ggattcttcc 
ggggtgct-tc tttcagccaa ggctatgaag 
ccccggccac atgccgcgtc cctgccccca 
ctcagtgtct gaacaatctc caaaattgaa 
ctgtgttttt aatataaata gtgtacacag 
aatatttaaa aaaaaaagcg gccgcgaatt 



FIG.10C 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 
(i) APPLICANT: Merck & Co., Inc. 

(ii) TITLE OF INVENTION: ASSAYS FOR NUCLEAR RECEPTOR 

AGONISTS AND ANTAGONISTS USING FLUORESCENCE RESONANCE 
ENERGY TRANSFER 

(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merck & Co., Inc. 

(B) STREET: P.O. Box 2000, 126 E. Lincoln Ave. 

(C) CITY: Rahway 

(D) STATE: NJ 

(E) COUNTRY: USA 

(F) ZIP: 07065-0900 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Coppola, Joseph A 

(B) REGISTRATION NUMBER: 38,413 

(C) REFERENCE/DOCKET NUMBER: 20017PCT 

(ix) TELECOMMUNICATION INFORMATION : 
{A) TELEPHONE: 732-594-6734 

(B) TELEFAX: 732-594-4720 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Ala Glu Asn Leu Leu Asp Gly Pro Pro Asn Pro Lys Arg Ala Lvs 

Leu Ser Ser Pro Gly Phe Ser Ala Asn Asp Ser Thr Asp Phe Gly Ser 

20 25 30 

Leu Phe Asp Leu Glu Asn Asp Leu Pro Asp Glu Leu He Pro Asn Gly 

35 40 45 

Gly Glu Leu Gly Leu Leu Asn Ser Gly Asn Leu Val Pro Asp Ala Ala 

50 55 60 

Ser Lys His Lys Gin Leu Ser Glu Leu Leu Arg Gly Gly Ser Gly Ser 
65 70 75 1 80 

Ser He Asn Pro Gly He Gly Asn Val Ser Ala Ser Ser Pro Val Gin 

85 90 95 

Gin Gly Leu Gly Gly Gin Ala Gin Gly Gin Pro Asn Ser Ala Asn Met 

100 105 110 

Ala Ser Leu Ser Ala Met Gly Lys Ser Pro Leu Ser Gin Gly Asp Ser 

115 120 125 

Ser Ala Pro Ser Leu Pro Lys Gin Ala Ala Ser Thr Ser Gly Pro Thr 

130 1 3 5 140 

Pro Ala Ala Ser Gin Ala Leu Asn Pro Gin Ala Gin Lys Gin Val Gly 

. 150 155 160 

Leu Ala Thr Ser Ser Pro Ala Thr Ser Gin Thr Gly Pro Gly He Cys 

165 170 175 

Met Asn Ala Asn Phe Asn Gin Thr His Pro Gly Leu Leu Asn Ser Asn 

180 185 190 

Ser Gly His Ser Leu He Asn Gin Ala Ser Gin Gly Gin Ala Gin Val 

195 200 205 

Met Asn Gly Ser Leu Gly Ala Ala Gly Arg Gly Arg Gly Ala Gly Met 

210 215 220 

Pro Tyr Pro Thr Pro Ala Met Gin Gly Ala Ser Ser Ser Val Leu Ala 

^ 230 235 240 

Glu Thr Leu Thr Gin Val Ser Pro Gin Met Thr Gly His Ala Gly Leu 

245 250 255 

Asn Thr Ala Gin Ala Gly Gly Met Ala Lys Met Gly He Thr Gly Asn 

260 265 270 

Thr Ser Pro Phe Gly Glh Pro Phe Ser Gin Ala Gly Gly Gin Pro Met 

275 280 285 

Gly Ala Thr Gly Val Asn Pro Gin Leu Ala Ser Lys Gin Ser Met Val 

290 295 3 0 o 

Asn ser Leu Pro Thr Phe Pro Thr Asp He Lys Asn Thr Ser Val Thr 

f , 310 3 15 320 

Asn Val Pro Asn Met Ser Gin Met Gin Thr Ser Val Gly He Val Pro 

325 330 335 

Thr Gin Ala lie Ala Thr Gly Pro Thr Ala Asp Pro Glu Lys Arg Lys 

3 3 45 350 

Leu He Gin Gin Gin Leu Val Leu Leu Leu His Ala His Lys Cys Gin 

355 3 «° 365 

Arg Arg Glu Gin Ala Asn Gly Glu Val Arg Ala Cys Ser Leu Pro His 

370 575 380 

Cys Arg Thr Met Lys Asn Val Leu Asn His Met Thr His Cys Gin Ala 

390 39 5 400 

Gly Lys Ala Cys Gin UU 

405 
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(2) INFORMATION FOR SEQ ID NO;2: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO :2: 



AAACTCAGCT CGCCCGGTTT CoJScSS ^ ~^ CCC CAAAAG AGCC 120 

. TTGGAAAATG ATCTTCCTGA TCA«S2t"I S ATTC *™AC 180 

AGTGGGAACC TTGTTCCAGA TGCTGCTTCC fJSi™ C CTTTTAAAC 240 

GGAGGCAGCG GCTCTACTAT CaSSS ^^T™ ^CTCTCGGA GCTTCTACGA 300 

CAGCAGGGCC TGGGTGGCCA 2SS 5222° <*" C ™ CCC «^ 360 

AGTGCCATGG GCAAGAGCCC -TCtcS GTGCTAACAT GGCCAGCCTC 420 

CAGGCAGCCA GCACCtSS 2£5£££ CAGCCCCCAG CCTGCCTAAA 480 

CAAAAGCAAG TCGGGcSS SSaSaS SESSSS ^CGCAAGCA 540 

TGCATGAATC CTAACTTTAA CCaSocca? CC^S^ ^ CAGACTCG ACCTGGTATC 600 

AGCTTAATTA ATCAGGCTTC SSSSSg 2£cS™ ^AATAGTAA CTCTGGCCAT 660 

GCTGGCAGAG GAAGGGGAGC TGGAATCTrr TCAATGGATC TCTTGGGGCT 720 

AGCAGCGTGC JS^C SSS gSttSSc" 780 

CTGAACACCG CACAGGCAGG AGGCATOGrr IL^™? AAATGACTGG TCACGOGGGA 840 

TTTGGACAGC CCTtoEtS aSSSgS TAACTGGGAA CACAAGTCCA 900 

CAGTTAGCCA GCAAACAGAG , ^ AGCCACTGG AGTCAACCCC 960 

AATACTTCAG TCACCAAG3T GCCAAATATG TCTCAGATCC AMjpaTra^ AGATATCAAG 1020 

CCCACACAAG CAATTGCAAC AGGCrrrar^ AAACATCAGT GGGAATTGTA 1080 

CAGCAGCTGG TTCTACtcS JSSS AAAAACGCAA ACTGATACAG 1140 

GAGGTTCGGG CCloSS SS£S£ SS AGCAAACGGA 1200 

ACGCATTGTC AGGCTGGGAA AGOCTGCcS AAAACGTTTT GAATCACATG 1260 

1290 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 468 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TOPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Val Asp Thr Glu Ser Pro Leu Cys Pro Le U Ser Pro Leu Glu Ala 

10 

Gly Asp Leu Glu Ser Pro Leu Ser Glu Glu Phe Leu Gin Glu Met Gly 
Asn lie Gin Glu lie Ser Gin Ser lie Gly Glu Asp Ser 11 Gly Ser 
Phe Gly Phe Glu ^ , n ^ ^ Qly ^ « ^ ^ ^ 



55 60 

-3- 
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Gly 


Ser 


Val 


He 


Thr 


Asp 


Thr 


Leu 


Ser 


Pro 


Ala 


Ser 


Ser 


Pro 


Ser 


Ser 


65 










70 










75 










80 


Val 


Thr 


Tyr 


Pro 


Val 


Val 


Pro 


Gly 


Ser 


Val 


Asp 


Glu 


Ser 


Pro 


Ser 


Gly 










85 










90 










95 


Ala 


Leu 


Asn 


He 


Glu 


Cys 


Arg 


He 


Cys 


Gly 


Asp 


Lys 


Ala 


Ser 


Gly 


Tyr 






Gly 
115 


100 










105 










110 


His 


Tyr 


Val 


His 


Ala 


Cys 


Glu 
120 


Gly 


Cys 


Lys 


Gly 


Phe 
125 


Phe 


Arg 


Arg 


Thr 


lie 


Arg 


Leu 


Lys 


Leu 


Val 


Tyr 


Asp 


Lys 


Cys 


Asp 


Arg 


Ser 


Cys 


Lys 




130 










135 










140 








lie 


Gin 


Lys 


Lys 


Asn 


Arg 


Asn 


Lys 


Cys 


Gin 


Tyr 


Cys 


Arg 


Phe 


His 


Lys 


145 










150 










155 










160 


Cys 


Leu 


Ser 


Val 


Gly 
165 


Met 


Ser 


His 


Asn 


Ala 
170 


He 


Arg 


Phe 


Gly 


Arg 
175 


Met 


Pro 


Arg 


Ser 


Glu 


Lys 


Ala 


Lys 


Leu 


Lys 


Ala 


Glu 


He 


Leu 


Thr 


Cys 


Glu 








180 










185 










190 




His 


Asp 


lie 


Glu 


Asp 


Ser 


Glu 


Thr 


Ala 


Asp 


Leu 


Lys 


Ser 


Leu 


Ala 


Lys 






195 










200 










205 






Arg 


He 


Tyr 


Glu 


Ala 


Tyr 


Leu 


Lys 


Asn 


Phe 


Asn 


Met 


Asn 


Lys 


Val 


Lys 




210 










215 










220 








Ala 


Arg 


Val 


He 


Leu 


Ser 


Gly 


Lys 


Ala 


Ser 


Asn 


Asn 


Pro 


Pro 


Phe 


Val 


225 










230 










235 










240 


He 


His 


Asp 


Met 


Glu 
245 


Thr 


Leu 


Cys 


Met 


Ala 
250 


Glu 


Lys 


Thr 


Leu 


Val 
255 


Ala 


Lys 


Leu 


Val 


Ala 


Asn 


Gly 


He 


Gin 


Asn 


Lys 


Glu 


Val 


Glu 


Val 


Arg 


He 








260 










265 










270 




Phe 


His 


Cys 
275 


Cys 


Gin 


Cys 


Thr 


Ser 
280 


Val 


Glu 


Thr 


Val 


Thr 
285 


Glu 


Leu 


Thr 


Glu 


Phe 


Ala 


Lys 


Ala 


He 


Pro 


Ala 


Phe 


Ala 


Asn 


Leu 


Asp 


Leu 


Asn 


Asp 




290 










295 










300 








Gin 


Val 


Thr 


Leu 


Leu 


Lys 


Tyr 


Gly 


Val 


Tyr 


Glu 


Ala 


He 


Phe 


Ala 


Met 


305 










310 










315 










320 


Leu 


Ser 


Ser 


Val 


Met 

325 


Asn 


Lys 


Asp 


Gly 


Met 
330 


Leu 


Val 


Ala 


Tyr 


Gly 
335 


Asn 


Gly 


Phe 


He 


Thr 


Arg 


Glu 


Phe 


Leu 


Lys 


Ser 


Leu 


Arg 


Lys 


Pro 


Phe 


Cys 








340 










345 










350 




Asp 


He 


Met 

355 


Glu 


Pro 


Lys 


Phe 


Asp 
360 


Phe 


Ala 


Met 


Lys 


Phe 
365 


Asn 


Ala 


Leu 


Glu 


Leu 


Asp 


Asp 


Ser 


Asp 


He 


Ser 


Leu 


Phe 


Val 


Ala 


Ala 


He 


He 


Cys 




370 










375 










380 








Cys 


Gly 


Asp 


Arg 


Pro 


Gly 


Leu 


Leu 


Asn 


Val 


Gly 


His 


He 


Glu 


Lys 


Met 


385 










390 










395 








400 


Gin 


Glu 


Gly 


He 


Val 
405 


His 


Val 


Leu 


Arg 


Leu 
410 


His 


Leu 


Gin 


Ser 


Asn 
415 


His 


Pro 


Asp 


Asp 


He 


Phe 


Leu 


Phe 


Pro 


Lys 


Leu 


Leu 


Gin 


Lys 


Met 


Ala 


Asp 








420 










425 










430 




Leu 


Arg 


Gin 


Leu 


Val 


Thr 


Glu 


His 


Ala 


Gin 


Leu 


Val 


Gin 


He 


He 


Lys 






435 










440 










445 






Lys 


Thr 


Glu 


Ser 


Asp 


Ala 


Ala 


Leu 


His 


Pro 


Leu 


Leu 


Gin 


Glu 


He 


Tyr 




450 










455 










460 








Arg 


Asp 


Met 


Tyr 



























465 



(2) INFORMATION FOR SEQ ID NO:4: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO:4: 

GGAGCTCGGC GGCACAACCA ^CCA^TG ^CGCGArGG ^^^c^ 
TCCCCCCTCT CCCCACTCGA GGCCGGOGAT g^J^ GCGAGGATAG TTCTGGAAGC 
CAAGAAATGG GAAACATCCA. AGAGATTTCG CAMO™£J GCTCAGATGG CTCGGTCATC 
TTTGGCTTTA CGGAATACCA CTATTTAOGA CTTATCCTGT GGTCCCCGGC 

ACGGACACGC TTTCACCAGC ^CGAGC^ SSSSr CTAGAATCTG CGGGGACAAG 
AGGGTCGACG AGTCTCCCAG TOGMCMTC SaGGCT GCAAGGGCTT CTTTCGGCGA 
GCCTCAGGCT ATCATTACGG AG^CGCG GCTCCAAGAT CCAGAAAAAG 

ACGATTCGAC TCAAGCTGGT GTATCACAAG TGCGACCO^a m:l:G ^cG G GATGTCACAC 
AACAGAAACA AATGCCAGTA TTGTOGATTT ^AA^ CAAAACTGAA AGCAGAAATT 
AACGCGATTC GTTTTGGACG AATGCCAAGA ATCTCAAATC TCTGGCCAAG 

CTTACCTGTG aacatgacat agaagattct JAAA^^ aggtcaaagc CCGGGTCATC 
AGAATCTACG AGGCCTACTT GAAGAACTTC AACATUAR ATGATATGGA GACACTGTGT 
CTCTCAGGAA AGGCCAGTAA C^TCCACCT gCATCCAGAA CAAGGAGGTG 1020 

ATGGCTGAGA AGACGCTGCT GGCCAAGOTG GTOGCCAA^ AGACCGTCAC GGAGCTCACG 1080 
GAGGTCCGCA TCTTTCACTG CTOCCACTGC ACGTCA^l^ ^ mTGACh rTG 1140 

GAATTCGCCA AGGCCATCCC AGCGTTCGCA AACTTGGA^ cTTCTGTGAT GAACAAAGAC 1200 
CTAAAATACG GAGTTTATGA G^CATATTC AATTCCTAAA AAGCOTAAGG 1260 

GGGATGCTGG TAGCGTATGG ^TGGGTTT CCATGAAGTT CAATGCACTG ■ «™ 

AAACCGTTCT GTGATATCAT GGAACCCAAG TTTGA^^ TCATTTGCTC TGGAGATCGT 
GAACTGGATG ACAGTGATAT CTCCCTTTTT SSS^T ACATGTGCTC 

CCTGGCCTTC TAAACGTAGG ACACATTGAA AAAATGCA^ ACTTCTTCAA 
AGACTCCACC TGCAGAGCAA CCACCCGGAC GATATCTTTC GATCATCA AG 
AAAATGGCAG ACCTCCGGCA GCTGGTGACG G J^%^ AGATCTACAG GGACATGTAC 
AAGACGGAGT CGGATGCTCC GCTGCACCCG GAAGCTGACA GCACTACAAA 

TGAGTTCCTT CAGATCAGCC ACACCTTTTC ACTTTAACCT TAGAGCTTGG 

GGAGACGGGG GAGCAGCACG ATTTTGCACA ^^^^ CCAGTACTTC 

TAAGAGCATA G^kAGTCAAAT G^TCGGGGAG GTGGCTAATC TCAGGACTGG GAAG 
(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 478 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Y1 nvn Pro Thr Asn Phe Gly 

Met Thr Met Val Asp Thr Glu He Ala Phe Trp Pro inr ^ 



1 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1854 
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lie 


Ser 


Ser Val Asp Leu Ser Val Met 


Glu 


Asp 


His Ser 


His 


Ser Phe 






20 25 








30 




Asp 


He 


Lys Pro Phe Thr Thr Val Asp 


Phe 


Ser 


Ser He 


Ser 


Thr Pro 






35 40 






45 






His 


Tyr 


Glu Asp He Pro Phe Thr Arg 


Thr 


Asp 


Pro Val 


Val 


Ala Asp 




50 


55 






60 






Tyr 


Lys 


Tyr Asp Leu Lys Leu Gin Glu 


Tyr 


Gin 


Ser Ala 


He 


Lys Val 


65 




70 




75 






80 


Glu 


Pro 


Ala Ser Pro Pro Tyr Tyr Ser 


Glu 


Lys 


Thr Gin 


Leu 


Tyr Asn 






85 


90 








95 


Lys 


Pro 


His Glu Glu Pro Ser Asn Ser 


Leu 


Met 


Ala He 


Glu 


Cys Arg 






100 105 








110 


Val 


Cys 


Gly Asp Lys Ala Ser Gly Phe 


His 


Tyr 


Gly Val 


His 


Ala Cys 






115 120 






125 






Glu 


Gly 


Cys Lys Gly Phe Phe Arg Arg 


Thr 


He 


Arg Leu 


Lys 


Leu He 




130 


135 






140 






Tyr 


Asp 


Arg Cys Asp Leu Asn Cys Arg 


He 


His 


Lys Lys 


Ser 


Arg Asn 


145 




150 




155 






160 


Lys 


Cys 


Gin Tyr Cys Arg Phe Gin Lys 


Cys 


Leu 


Ala Val 


Gly 


Met Ser 






165 


170 








175 


His 


Asn 


Ala He Arg Phe Gly Arg He 


Ala 


Gin 


Ala Glu 


Lys 


Glu Lys 






180 185 








190 




Leu 


Leu 


Ala Glu He Ser Ser Asp He 


Asp 


Gin 


Leu Asn 


Pro 


Glu Ser 






195 200 






205 






Ala 


Asp 


Leu Arg Gin Ala Leu Ala Lys 


His 


Leu 


Tyr Asp 


Ser 


Tyr He 




210 


215 






220 






Lys 


Ser 


Phe Pro Leu Thr Lys Ala Lys 


Ala 


Arg 


Ala He 


Leu 


Thr Gly 


225 




230 




235 






240 


Lys 


Thr 


Thr Asp Lys Ser Pro Phe Val 


He 


Tyr Asp Met 


Asn 


Ser Leu 






245 


250 








255 


Met 


Met 


Gly Glu Asp Lys He Lys Phe 


Lys 


His 


He Thr 


Pro 


Leu Gin 






260 265 








270 




Glu 


Gin 


Ser Lys Glu Val Ala He Arg 


lie 


Phe Gin Gly 


Cys 


Gin Phe 






275 280 






285 






Arg 


Ser 


Val Glu Ala Val Gin Glu He 


Thr 


Glu Tyr Ala 


Lys 


Ser He 




290 


295 






300 






Pro 


Gly 


Phe Val Asn Leu Asp Leu Asn 


Asp 


Gin Val Thr 


Leu 


Leu Lys 


305 




310 




315 






320 


Tyr 


Gly 


Val His Glu He He Tyr Thr 


Met 


Leu 


Ala Ser 


Leu 


Met Asn 






325 


330 








335 


Lys 


Asp 


Gly Val Leu He Ser Glu Gly 


Gin 


Gly 


Phe Met 


Thr 


Arg Glu 






340 345 








350 




Phe 


Leu 


Lys Ser Leu Arg Lys Pro Phe 


Gly 


Asp Phe Met 


Glu 


Pro Lys 






355 360 






365 






Phe 


Glu 


Phe Ala Val Lys Phe Asn Ala 


Leu 


Glu Leu Asp 


Asp 


Ser Asp 




370 


375 






380 






Leu 


Ala 


He Phe He Ala Val He He 


Leu 


Ser Gly Asp 


Arg 


Pro Gly 


385 




390 




395 






400 


Leu 


Leu 


Asn Val Lys Pro He Glu Asp 


He 


Gin Asp Asn 


Leu 


Leu Gin 






405 


410 








415 


Ala 


Leu 


Glu Leu Gin Leu Lys Leu Asn 


His 


Pro 


Glu Ser 


Ser 


Gin Leu 






420 • 425 








430 




Phe 


Ala 


Lys Leu Leu Gin Lys Met Thr 


Asp 


Leu Arg Gin 


He 


Val Thr 






435 440 






445 






Glu 


His 


Val Gin Leu Leu Gin Val He 


Lys 


Lys Thr Glu 


Thr 


Asp Met 




450 


455 






460 
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Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys Asp Leu Tvr 
465 470 475 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CCGACCTTAC CCCAGGCGGC CTTGACGTTG GTCTTGTCGG CAGGAGACAG CACCATGGTG 60 

GCTTCTCTCT GAGTCTGGGA ATTCCCGAGC CCGAGCCGCA GCCGCCGCCT SSStS 120 

GGTCGGCCTC GAGGACACCG GAGAGGGGCG CCACGCCGCC GTGGCCGCAG AAAtSSS 180 

GOTTGACACA GAGATCGCAT TCTGGCCCAC CAACTTTGGG ATCAGCTCCG TgSSS III 

CGTAATGGAA GACCACTCCC ACTCCTTTGA TATCAAGCCC TTCACTACTG TTGACTTCTC 300 

C ™%7™^ ACTCCACATT ACGAAGACAT TCCATICACA AGAACAGATC SSSSS III 

AGATTACAAG TATGACCTGA AACTTCAAGA GTACCAAAGT GCAATCAAAG TGGAGCCTGC 420 

ATCTCCACCT TATTATTCTG AGAAGACTCA GCTCTACAAT AAGCCTCATG AAGAGCCTTC 480 

CAACTCCCTC ATGGCAATTG AATGTCGTGT CTGTGGAGAT AAAGCTTCTG GATTTCACTA 540 

TGGAGTTCAT GCTTGTGAAG GATGCAAGGG TTTCTTCCGG AGAACAATCA GATTGAAGCT 600 

TATCTATGAC AGATGTGATC TTAACTGTCG GATCCACAAA AAAAGTAGAA ATAAATGtS 6S0 

TrrCAGAAAT GCCTTGCAGT GGGGATGTCT CATAATGCCA TCAGgSS 720 

GCGGATCGCA CAGGCCGAGA AGGAGAAGCT GTTGGCGGAG ATCTCCAGTC ATATCGACCA 780 

GCTGAATCCA GAGTCCGCTG ACCTCCGTCA GGCCCTGGCA AAACATTTGT ATGACTCATA 840 

CATAAAGTCC TTCCCGCTGA CCAAAGCAAA GGCGAGGGCG ATCTTGACAG GAAAGACAAC 900 

CCATTCGTTA TCTATGACAT GAATTCCTTA ATGATGGGAG ££2SS£ 

CAAGTTCAAA CACATCACCC CCCTGCAGGA GCAGAGCAAA GAGGTGGCCA TCCGCATCTT 1020 

TCAGGGCTGC cagtttcgct CCGT3GAGGC tgtgcaggag atcacagagt lOBO 

CATTCCTGGT TTTGTAAATC TTGACTTGAA CGACCAAGTA ACTCTCCTCA AAtJtGMg? llTo 

ATTrACACAA toctggcctc cttgatcaat aaagatgggg tSS?S 1200 

CGAGGGCCAA GGCTTCATGA CAAGGGAGTT TCTAAAGAGC CTGCGAAAGC CTTTTCGTGA 1260 

CTTTATGGAG CCCAAGTTTG AGTTTGCTGT GAAGTTCAAT GCACTCGAAT TAGATGACAG 1320 

CGACTTGGCA ATATTTATTG CTGTCATTAT TCTCAGTGGA GACCGCCCAG GTTTGCtSS lilt 

TGTGAAGCCC ATTGAAGACA TTCAAGACAA CCTGCTACAA GCCCTGGAGC TCCAGCTGAA 1440 

GCTGAACCAC CCTGAGTCCT CACAGCTGTT TGCCAAGCTG CTCCAGAAAA tSSSSS Itto 

CAGACAGATT GTCACGGAAC ACGTCCAGCT ACTGCAGGTG ATCAAGAAGA cSSa^S lllo 

CMGAGTCTT CACCCGCTCC TGCAGGAGAT CTACAAGGAC TTGTACTAGC SSStS lltl 

GAGCCACTGC CAACATTTCC CTTCTTCCAG TTGCACTATT CTCAGGGAAA ATCTGACCAT 1680 

MerM>A^r ^to^*^*^ gcgttttaaa AAGAAAAGGG TTTAGAATAT gatctatttt iilo 

ATGCATATTG TTTATAAAGA CACATTTACA ATTTACTTTT AATATTAAAA ATTACCATAT 1800 



(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 441 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met Glu Gin Pro Gin Glu Glu Ala Pro Glu Val Arg Glu Glu Glu Glu 

15 10 15 

Lys Glu Glu Val Ala Glu Ala Glu Gly Ala Pro Glu Leu Asn Gly Gly 

20 25 30 

Pro Gin His Ala Leu Pro Ser Ser Ser Tyr Thr Asp Leu Ser Arg Ser 

35 40 45 

Ser Ser Pro Pro Ser Leu Leu Asp Gin Leu Gin Met Gly Cys Asp Gly 

50 55 60 

Ala Ser Cys Gly Ser Leu Asn Met Glu Cys Arg Val Cys Gly Asp Lys 
65 70 75 80 

Ala Ser Gly Phe His Tyr Gly Val His Ala Cys Glu Gly Cys Lys Gly 

85 90 95 

Phe Phe Arg Arg Thr lie Arg Met Lys Leu Glu Tyr Glu Lys Cys Glu 

100 105 110 

Arg Ser Cys Lys lie Gin Lys Lys Asn Arg Asn Lys Cys Gin Tyr Cys 

115 120 125 

Arg Phe Gin Lys Cys Leu Ala Leu Gly Met Ser His Asn Ala lie Arg 

130 135 140 

Phe Gly Arg Met Pro Glu Ala Glu Lys Arg Lys Leu Val Ala Gly Leu 
145 150 155 160 

Thr Ala Asn Glu Gly Ser Gin Tyr Asn Pro Gin Val Ala Asp Leu Lys 

165 170 175 

Ala Phe Ser Lys His lie Tyr Asn Ala Tyr Leu Lys Asn Phe Asn Met 

180 185 190 

Thr Lys Lys Lys Ala Arg Ser lie Leu Thr Gly Lys Ala Ser His Thr 

195 200 205 

Ala Pro Phe Val lie His Asp lie Glu Thr Leu Trp Gin Ala Glu Lys 

210 215 220 

Gly Leu Val Trp Lys Gin Leu Val Asn Gly Leu Pro Pro Tyr Lys Glu 
225 230 235 240 

He Ser Val His Val Phe Tyr Arg Cys Gin Cys Thr Thr Val Glu Thr 

245 250 255 

Val Arg Glu Leu Thr Glu Phe Ala Lys Ser He Pro Ser Phe Ser Ser 

260 265 270 

Leu Phe Leu Asn Asp Gin Val Thr Leu Leu Lys Tyr Gly Val His Glu 

275 280 285 

Ala He Phe Ala Met Leu Ala Ser He Val Asn Lys Asp Gly Leu Leu 

290 295 300 

Val Ala Asn Gly Ser Gly Phe Val Thr Arg Glu Phe Leu Arg Ser Leu 
305 310 315 320 

Arg Lys Pro Phe Ser Asp He He Glu Pro Lys Phe Glu Phe Ala Val 

325 330 335 

Lys Phe Asn Ala Leu Glu Leu Asp Asp Ser Asp Leu Ala Leu Phe He 

340 345 350 

Ala Ala He He Leu Cys Gly Asp Arg Pro Gly Leu Met Asn Val Pro 

355 360 365 

Arg Val Glu Ala He Gin Asp Thr He Leu Arg Ala Leu Glu Phe His 

370 375 380 

Leu Gin Ala Asn His Pro Asp Ala Gin Tyr Leu Phe Pro Lys Leu Leu 
385 390 395 400 

Gin Lys Met Ala Asp Leu Arg Gin Leu Val Thr Glu His Ala Gin Met 
405 410 415 
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Met 



Leu 



Gin Arg lie Lys Lys Thr Glu Thr 
420 425 
Gin Glu lie Tyr Lys Asp Met Tyr 



Glu Thr Ser Leu His Pro Leu 
430 



435 440 



(2) INFORMATION FOR SEQ ID NO:8: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3301 base pairs 

(B) TOPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

GAATTCTGCG GAGCCTGCGG GACGGCGGCG GGTTGGCCCG TAGGCAGCCG GGACAGTGTT 60 

GTACAGTGTT TTGGGCATGC ACGTGATACT CACACAGTGG CTTCTGCTCA CCAACAGATG 120 

AAGACAGATG CACCAACGAG GGTCTGGAAT GGTCTGGAGT GGTCTGGAAA GCAGGGTCAG 180 

ATACCCCTGG AAAACTGAAG CCCGTGGAGC AATGATCTCT ACAGGACTGC TTCAAGGCTG 240 

ATGGGAACCA CCCTGTAGAG GTCCATCTGC GTTCAGACCC AGACGATGCC AGAGCTATGA 300 

CTGGGCCTGC AGGTGTGGCG CCGAGGGGAG ATCAGCCATC GAGCAGCCAC AGGAGGAAGC 360 

CCCTGAGGTC CGGGAAGAGG AGGAGAAAGA GGAAGTGGCA GAGGCAGAAG GAGCCCCAGA 420 

GCTCAATGGG GGACCACAGC ATGCACTTCC TTCCAGCAGC TACACAGACC TCTCCCGGAG 480 

CTCCTCGCCA CCCTCACTGC TGGACCAACT GCAGATGGGC TGTGACGGGG CCTCATGCGG 540 

CAGCCTCAAC ATGGAGTGCC GGGTGTGCGG GGACAAGGCA TCGGGCTTCC ACTACGGTGT 600 

TCATGCATGT GAGGGGTGCA AGGGCTTCTT CCGTCGTACG ATCCGCATGA AGCTGGAGTA 660 

CGAGAAGTGT GAGCGCAGCT GCAAGATTCA GAAGAAGAAC CGCAACAAGT GCCAGTACTG 720 

CCGCTTCCAG AAGTGCCTGG CACTGGGCAT GTCACACAAC GCTATCCGTT TTGGTCGGAT 780 

GCCGGAGGCT GAGAAGAGGA AGCTGGTGGC AGGGCTGACT GCAAACGAGG GGAGCCAGTA 840 

CAACCCACAG GTGGCCGACC TGAAGGCCTT CTCCAAGCAC ATCTACAATG CCTACCTGAA 900 

AAACTTCAAC ATGACCAAAA AGAAGGCCCG CAGCATCCTC ACCGGCAAAG- CCAGCCACAC 960 

GGCGCCCTTT GTGATCCACG ACATCGAGAC ATTGTGGCAG GCAGAGAAGG GGCTGGTGTG 1020 

GAAGCAGTTG GTGAATGGCC TGCCTCCCTA CAAGGAGATC AGCGTGCACG TCTTCTACCG 1080 

CTGCCAGTGC ACCACAGTGG AGACCGTGCG GGAGCTCACT GAGTTCGCCA AGAGCATCCC 1140 

CAGCTTCAGC AGCCTCTTCC TCAACGACCA GGTTACCCTT CTCAAGTATG GCGTGCACGA 1200 

GGCCATCTTC GCCATGCTGG CCTCTATCGT CAACAAGGAC GGGCTGCTGG TAGCCAACGG 1260 

CAGTGGCTTT GTCACCCGTG AGTTCCTGCG CAGCCTCCGC AAACCCTTCA GTGATATCAT 1320 

TGAGCCTAAG TTTGAATTTG CTGTCAAGTT CAACGCCCTG GAACTTGATG ACAGTGACCT 1380 

GGCCCTATTC ATTGCGGCCA TCATTCTGTG TGGAGACCGG CCAGGCCTCA TGAACGTTCC 1440 

ACGGGTGGAG GCTATCCAGG ACACCATCCT GCGTGCCCTC GAATTCCACC TGCAGGCCAA 1500 

CCACCCTGAT GCCCAGTACC TCTTCCCCAA GCTGCTGCAG AAGATGGCTG ACCTGCGGCA 1560 

ACTGGTCACC GAGCACGCCC AGATGATGCA GCGGATCAAG AAGACCGAAA CCGAGACCTC 1620 

GCTGCACCCT CTGCTCCAGG AGATCTACAA GGACATGTAC TAACGGCGGC ACCCAGGCCT 1680 

CCCTGCAGAC TCCAATGGGG CCAGCACTGG AGGGGCCCAC CCACATGACT TTTCCATTGA 1740 

CCAGCTCTCT TCCTGTCTTT GTTGTCTCCC TCTTTCTCAG TTCCTCTTTC TTTTCTAATT 1800 

CCTGTTGCTC TGTTTCTTCC TTTCTGTAGG TTTCTCTCTT CCCTTCTCCC TTCTCCCTTG 1860 

CCCTCCCTTT CTCTCTCCTA TCCCCACGTC TGTCCTCCTT TCTTATTCTG TGAGATGTTT 1920 

TGTATTATTT CACCAGCAGC ATAGAACAGG ACCTCTGCTT TTGCACACCT TTTCCCCAGG 1980 

AGCAGAAGAG AGTGGGCCTG CCCTCTGCCC CATCATTGCA CCTGCAGGCT TAGGTCCTCA 2040 

CTTCTGTCTC CTGTCTTCAG AGCAAAAGAC TTGAGCCATC CAAAGAAACA CTAAGCTCTC 2100 

TGGGCCTGGG TTCCAGGGAA GGCTAAGCAT GGCCTGGACT GACTGCAGCC CCCTATAGTC 2160 

ATGGGGTCCC TGCTGCAAAG GACAGTGGCA GACCCCGGCA GTAGAGCCGA GATGCCTCCC 2220 

CAAGACTGTC ATTGCCCCTC CGATCGTGAG GCCACCCACT GACCCAATGA TCCTCTCCAG 2280 

CAGCACACCT CAGCCCCACT GACACCCAGT GTCCTTCCAT CTTCACACTG GTTTGCCAGG 2340 
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CCAATGTTGC TGATGGCCCC TCCAGCACAC ACACATAAGC ACTGAAATCA CTTTACCTGC 2400 

AGGCACCATG CACCTCCCTT CCCTCCCTGA GGCAGGTGAG AACCCAGAGA GAGGGGCCTG 2460 

CAGGTGAGCA GGCAGGGCTG GGCCAGGTCT CCGGGGAGGC AGGGGTCCTG CAGGTCCTGG 2520 

TGGGTCAGCC CAGCACCTCG CCCAGTGGGA GCTTCCCGGG ATAAACTGAG CCTGTTCATT 2580 

CTGATGTCCA TTTCTCCCAA TAGCTCTACT GCCCTCCCCT TCCCCTTTAC TCAGCCCAGC 2640 

TGGCCACCTA GAAGTCTCCC TGCACAGCCT CTAGTGTCCG GGGACCTTGT GGGACCAGTC 2700 

CCACACCGCT GGTCCCTGCC CTCCCCTGCT CCCAGGTTGA GGTGCGCTCA CCTCAGAGCA 2760 

GGGCCAAAGC ACAGCTGGGC ATGCCATGTC TGAGCGGCGC AGAGCCCTCC AGGCCTGCAG 2820 

GGGCAAGGGG CTGGCTGGAG TCTCAGAGCA CAGAGGTAGG AGAACTGGGG TTCAAGCCCA 2880 

GGCTTCCTOG GTCCTGCCTG GTCCTCCCTC CCAAGGAGCC ATTCTATGTG ACTCTGGGTG 2940 

GAAGTGCCCA GCCCCTGCCT GACGGNNNNN NNGATCACTC TCTGCTQGCA GGATTCTTCC 3000 

CGCTCCCCAC CTACCCAGCT GATGGGGGTT GGGGTGCTTC TTTCAGCCAA GGCTATGAAG 3060 

GGACAGCTGC TGGGACCCAC CTCCCCCCTT CCCCGGCCAC ATGCCGCGTC CCTGCCCCCA 3120 

CCCGGGTCTG GTGCTGAGGA TACAGCTCTT CTCAGTGTCT GAACAATCTC CAAAATTGAA 3180 

ATGTATATTT TTGCTAGGAG CCCCAGCTTC CTGTGTTTTT AATATAAATA GTGTACACAG 3240 

ACTGACGAAA CTTTAAATAA ATGGGAATTA AATATTTAAA AAAAAAAGCG GCCGCGAATT 3300 

C 3301 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
ACTCGGATCC AAGCCATGGC TGAGAACTTG CTGGACGG 38 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
CACAAAGCTT AGGCCATGTT AGCACTGTTC GG 3: 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
CTCAGTCGAC TTATTGAATT CCACTAGCTG GAGATCC 
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